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Introduction to the course 


Welcome to The Quantum World! The books in this series, of which this is the 
first, provide a wide-ranging survey of the most fundamental physical principles 
that govern the way the world works. It has long been recognized that those 
principles are rooted in quantum physics, and it is therefore essentially a quantum 
world in which we live and a quantum world that these books describe. 


Quantum physics was born in 1900, when the German physicist Max Planck All emboldened terms are 
used the term ‘quantum’ to describe the minimum amount of energy that could be defined in the Course Glossary. 
exchanged between matter and radiation at a specified frequency. Over the 

following 25 years or so, Planck’s original idea was developed and elaborated by 

Albert Einstein, Niels Bohr, Arnold Sommerfeld and many other physicists. 

These pioneers tackled a variety of problems involving the interaction of matter 

and radiation including, most notably, but not entirely successfully, the absorption 

and emission of light by atoms. The subject they created, based partly on ad-hoc 

rules and involving an uncomfortable fusion of classical and quantum concepts, 

is now referred to as the old quantum theory and is generally regarded as 

representing humanity’s first faltering steps towards a new kind of physics. 


The need for something more systematic was recognized by Max Born, who 
coined the term quantum mechanics in 1924. Born saw, perhaps more clearly 
than any of his contemporaries, that what was needed was an entirely new 
physics, comparable to, yet different from, the preceding classical physics 

in which he and his fellow physicists had been trained. Just as the classical 
mechanics of Newton provides one of the main pillars of classical physics, so 
would quantum mechanics provide a foundation for the new coherent subject of 
quantum physics that was to come. 


Fittingly, it was Born’s brilliant young research assistant, Werner Heisenberg, who 
made the key breakthrough that ended the era of ad-hoc quantum rules and 
highlighted the gulf that would forever separate quantum mechanics from its 
classical counterpart. The details of Heisenberg’s breakthrough need not detain us 
here, but we should note that the complexities of his analysis were soon eased by 
the introduction of a simpler and more transparent approach pioneered by the 
Austrian physicist Erwin Schrodinger. It is this latter approach to quantum 
mechanics, based on waves and therefore known as wave mechanics, that is the 
subject of this book. 


Chapter 1 will introduce you to some of the phenomena that caused so much 
perplexity in the opening decades of the twentieth century. You will also have 
your first glimpse of the probabilistic ideas of quantum physics that eventually 
cleared away the confusion. These ideas are studied more systematically in 
Chapter 2, which introduces Schrédinger’s equation, the fundamental equation of 
wave mechanics. Chapter 3 then goes on to apply Schrédinger’s equation to some 
simple situations involving particles in microscopic boxes. 


Chapter 4 introduces Heisenberg’s uncertainty principle, which is one of the most 
celebrated principles of quantum physics since it so clearly distinguishes quantum 
mechanics from classical mechanics. In the course of doing this, the chapter 

will show you how to predict the average value, and the spread in values, of 
measurable quantities in wave mechanics. Chapter 5 uses these ideas to study the 
quantum analogue of a classical simple harmonic oscillator, exemplified by the 
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vibrational energies of atoms in diatomic molecules. Up to this point, the focus is 
on concepts like energy and momentum, but without any direct description of 
the motion of a particle from one point in space to another. This omission is 
remedied in Chapter 6 which analyzes the motion of particles in terms of wave 
packets and explores the circumstances under which the predictions of quantum 
physics become practically the same as those of classical physics. This is a 
significant point, since quantum physics claims to be universally valid, applying 
to large-scale systems as well as to atoms and nuclei. 


The extraordinary quantum phenomena that occur when particles are scattered by 
obstacles is one of the main subjects of Chapter 7. This chapter brings to an end 
the physical discussion in the book, concluding on a high note by describing the 
phenomenon of quantum tunnelling and examining its significance in nuclear 
decays, stellar astrophysics and the working of the scanning tunnelling 
microscope. There is, however, one additional chapter; this is devoted to 
mathematical topics, particularly complex numbers, differential equations and 
probability, and is designed to be studied, whenever appropriate, alongside the 
other chapters. Don’t forget this final chapter, which constitutes a Mathematical 
toolkit, and don’t make the mistake of leaving it until the end! 


Although this book covers the basic principles of wave mechanics there are many 
aspects of the quantum world that it does not address. Some of the deeper aspects 
of quantum mechanics, especially those involving multi-particle systems, angular 
momentum and the fascinating phenomenon of entanglement, are explored in the 
second book in the series, while the applications of quantum mechanics to atoms, 
molecules and solids are the theme of the third book. 


In the second half of the twentieth century, quantum mechanics became 
increasingly important for technology — in semiconductors and lasers, for 
example. The British Parliament was told that ‘something like 25% of the GDP of 
the USA is based on physics that is totally derived from quantum mechanics 
research’. Many physicists have expressed the view that the twenty-first century 
will be the century of quantum technology and quantum devices. If so, it is 

the principles contained in these three books that will provide the essential 
background needed to understand these devices and, perhaps, to participate in 
their development. 


Chapter | The quantum revolution 


Introduction 


This chapter gives an overview of some of the ways in which our understanding of 
the world has been changed by quantum mechanics. In the process of doing this, 
it will remind you of some basic physics ideas that are needed throughout the 
course and allow you to review the mathematics of complex numbers. 


The chapter is organized as follows. Section 1 starts with the observation that 
many systems emit light with characteristic patterns of spectral lines. This is 
related to the fact the energy levels of these systems are quantized (or discrete). 
Section 2 introduces one of the most pervasive features of quantum mechanics — 
the fact that it deals with probabilities rather than certainties. This is illustrated by 
radioactive decay and by commercial devices that exploit quantum mechanics to 
produce random numbers. Sections 3 and 4 discuss the perplexing fact that 
photons and electrons behave in some ways like particles, and in other ways like 
waves. In the course of describing electron waves, we introduce some concepts 
that will be needed later in the course, including de Broglie’s relationship and 
wave functions. Section 5 takes a closer look at the quantum-mechanical 
principles that lie behind the phenomenon of interference and, finally, Section 6 
reflects on the quantum mysteries that lie ahead. 


l.l The quantization of energy 


In the 1830s the philosopher Auguste Comte gave an example of something that 
he believed would forever be beyond human knowledge: the composition of 

the stars. Evidently he was unaware that in 1813 Joseph von Fraunhofer had 
found spectral lines in light from the Sun. Analyzing heavenly substances 
became possible in the 1860s when Gustav Kirchhoff and Robert Bunsen used a 
spectroscope to examine the colours emitted by chemical elements heated in 
flames. They found that the patterns of spectral lines that appear provide a unique 
signature for each element — its spectrum (Figure 1.1). 


helium 


iron 
neon 


Figure I.I The spectra of some common elements. Top, helium; centre, iron; 
bottom, neon. 


The same patterns of spectral lines are found in stars. So Comte could not have 
been more wrong: thanks to spectral analysis, we now know in considerable detail 


Figure 1.2 Niels Bohr 
(1885-1961). Bohr accounted 
for the spectrum of atomic 
hydrogen by applying simple 
quantum rules to determine the 
energy levels of a hydrogen 
atom. He received the Nobel 
prize for physics in 1922. 


his pronounced ‘h-bar’. 
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what stars are made of. Much has also been learned about the history of the 
Universe by studying the abundances of the elements in stars. The spectral lines 
produced by atoms under given conditions are the same wherever those atoms are 
in the Universe and they have not changed for billions of years. But how do 
atoms, indestructible enough to have survived unchanged for billions of years, 
produce spectral lines? 


The first steps towards an explanation were taken by the Danish physicist Niels 
Bohr (Figure 1.2), who gave a semi-quantum explanation of the spectrum of the 
simplest atom, hydrogen. A major theme of this course will be the success of 
quantum mechanics in explaining the spectra of nuclei, atoms, molecules etc., 
going far beyond what Bohr achieved. 


1.1.1 Energy levels 


One of Bohr’s key discoveries was that atoms possess discrete energy levels; their 
energies are said to be quantized. As a result, atoms behave differently from, say, 
red-hot pokers. The temperature of a cooling poker appears to fall continuously as 
energy is lost to the environment. An atom, however, is characterized by a series 
of discrete energy levels, so it cannot lose energy continuously. Instead, it 

jumps from one energy level to another of lower energy, emitting a packet of 
electromagnetic radiation, a photon, as it does so. After a short time it may emit 
another photon, jumping to a state of still lower energy. This continues until the 
atom reaches its state of lowest energy, the ground state. Each photon is a packet 
of electromagnetic radiation with a frequency determined by the energy of the 
photon. If an atom jumps from a state of energy Æ; to a state of lower energy E;, 
then the energy of the emitted photon, Ephoton, and the corresponding frequency 
of the radiation, call it f;;, are determined by the change in energy of the atom: 


Ephoton = hfji = Ej =, (1.1) 


where h = 6.63 x 10~°4J s is Planck’s constant, the fundamental constant that 
characterizes most quantum phenomena. The emitted radiation can also be 
described in terms of the angular frequency wj; = 27 fji, and the energy of the 
photon is then: 


Ephoton = ħwji = Ej = Ei, (1.2) 
where fi = h/2a = 1.06 x 10-*J s. 


Each kind of atom has its own characteristic series of energy levels, #44, E2, 

.., Hj,...,£;,..., with E1 corresponding to the ground state. An atom that has 
been excited to a high energy, maybe in a flame or on the surface of a star, 
will lose energy by emitting radiation at a characteristic set of frequencies 
fj = |E; — E;|/h. These frequencies account for the spectral lines and provide 
the atom’s unique spectral signature, anywhere in the Universe. 


In this course you will see how the energy levels of atoms can be calculated using 
the principles of quantum mechanics. Not just atoms but also molecules, atomic 
nuclei and the fundamental constituents of nuclei have states with specific 
energies (Figure 1.3). 


l.l The quantization of energy 


continuum 
states of 
ionized atom | 


Figure 1.3 Comparing the 
rotational energy levels of a 
hydrogen chloride molecule 
(HCl), the electronic energy 
levels in a hydrogen atom (H), 
and the nuclear energy levels of 
the mercury isotope !°°Hg. 


Each horizontal line in Figure 1.3 represents the energy of a particular state of 
the system above the ground state, which has been assigned zero energy in 

each case. The energy scales are very different. The most convenient unit of 
measurement for electronic energy levels in atoms is the electronvolt (eV), with 
leV = 1.60 x 1071? J. Rotational energies of molecules are best measured in 
millielectronvolts (1 meV = 107 eV) and nuclear energies are measured in 
megaelectronvolts (1 MeV = 10° eV). The factor of a million or so between 
atomic and nuclear energies lies behind the fact that electricity generators can get 
roughly a million times more energy from a lump of uranium than from the same 
mass of coal. Not all energy levels are shown: for the HCl molecule and the Hg 
nucleus there are countless energy levels above the top of the scale. The hydrogen 
atom also has indefinitely many energy levels, but crowded together below the 
limiting energy of 13.6 eV, above which energy the atom loses its electron and is 
said to be ionized. All these energy levels are known from a study of the photons 
emitted when the molecule, atom or nucleus jumps from one energy level to 
another. 


Exercise l.l] | Atomic nuclei, like atoms, have discrete energy levels. A 
nucleus jumps between two energy levels 1 MeV apart emitting a photon; a 
photon with such an energy is called a gamma-ray photon. What is the ratio of the 
frequency of a 1 MeV gamma-ray photon to a 1 MHz frequency (corresponding to 
a photon in the AM radio band)? 


Exercise 1.2 Do you think that a red-hot poker really cools down 
continuously, or does it, too, cool in jumps if viewed closely enough? | 


il 
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If atoms, molecules, etc. have a ground state of lowest energy, do they also have a 
state of maximum energy? Can their energy be increased without limit? What 
happens when you pump more and more energy into an atom by bombarding it 
with photons or by allowing it to collide with other energetic atoms? Such 
collisions certainly occur in a neon tube, or in a star, for example. The answer is 
that above a certain energy the atom becomes ionized, that is to say, an electron is 
ejected. The positively-charged remnant of the original atom is referred to as an 
ion. If still more energy is supplied, the ion may lose another electron and become 
doubly ionized. 


When an atom is ionized, the combined system comprising the ion and a detached 
electron can exist at any energy above that at which the electron is ejected. The 
system is said to be ‘in the continuum’ a term drawing attention to the fact that, 
unlike the discrete energies for which the electron is bound, a continuous range of 
energies is now possible. Figure 1.3 shows the continuum of states that exists 
above 13.6 eV for an ionized hydrogen atom. 


@ Which element has atoms that cannot be doubly ionized? 


O A hydrogen atom cannot be doubly ionized since it has only one electron. 


@ A helium atom has atomic number Z = 2. An alpha particle is a 
doubly-ionized atom of helium. How many electrons does it have? 


O A neutral helium atom has two electrons, so a doubly-ionized helium atom 
(an alpha particle) has no electrons. 


1.1.2 Democritus had true insight! 


The fact that atoms and molecules have particular energy levels is of exceptional 
scientific importance, and not just because the differences between energy 

levels provide the foundation for spectroscopy. It explains a deep mystery that 
sometimes goes unnoticed. Why is any element or compound always exactly the 
same, no matter what processes it has undergone? You can boil water and 
condense it, freeze it and melt it and it is still the same old water. It was the 
attempt to understand such behaviour that led the ancient Greek thinkers 
Leucippus (480-420 BC) and Democritus (460-370 BC) to put forward the idea of 
indestructible infinitely-hard atoms, with no internal structure. 


We now know that atoms do have internal structure and that this accounts for their 
spectra. But how can atoms with internal structure be virtually indestructible? 
Only since the advent of quantum mechanics have we been able to resolve this 
issue. The key point is that all atoms of a given kind have identical ground states. 
Bump them, jostle them hard enough, shine light on them, and they might jump 
into an excited state of higher energy, but they will eventually lose energy 

and return to the ground state. Jostle them even harder or bombard them with 
electrons, as in an arc lamp or on the surface of the Sun, and some of their 
electrons may get knocked out: an atom may become ionized. But an ionized 
atom is a charged atom, and charged atoms will pick up electrons, given a chance, 
so the atom will reassemble and end up back in its ground state again. Unless they 
undergo nuclear processes, atoms are effectively indestructible and their ground 
states are always the same. These are ideas of enormous explanatory power. They 
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explain why atoms have survived for billions of years with unchanged spectral 
signatures. None of this was intelligible before the advent of quantum theory. 


1.2 A farewell to determinism 


A deeply-held belief of physicists at the beginning of the twentieth century was 
that the same effect inevitably follows a given cause; that precisely-defined 
circumstances always lead to the same precisely-defined consequences. For 
example, if the exact positions and velocities of colliding snooker balls were 
known, physicists were confident that they could predict exactly where the balls 
would go. In this sense, the world was believed to be deterministic. But quantum 
physics changes all that: some things in Nature are governed by chance. Classical 
physics is deterministic even when it might appear not to be; it is the sheer 
complexity of a sample of gas, for example, that leads to apparent randomness. 
The quantum world, however, is irreducibly indeterministic and some things 
cannot be predicted, even in principle, with the largest computer imaginable. In 
this course we shall see how quantum mechanics can predict much about atoms, 
but some things, such as when a particular radioactive nucleus will decay, remain 
a matter of pure chance. 


1.2.1 The exponential law of radioactive decay 


Our first example of quantum indeterminism is the exponential law of radioactive 

decay, discovered by Ernest Rutherford in 1900, before quantum mechanics was 

established. It was only many years later that the significance of this discovery 

sank in. What makes it remarkable is the fact that, as we now understand, nuclei 

can be absolutely identical. We cannot have absolutely identical copies of The fact that atoms or nuclei 
macroscopic objects like white snooker balls. However, all atomic nuclei of a of a given kind, as well as 
specific kind in their ground states are absolutely identical. But though identical, fundamental particles like 

they do not decay at identical times; there is no hidden property telling a nucleus electrons, can be identical 
when to decay. is a uniquely quantum 
phenomenon, without parallel in 
the everyday world. It has 
profound significance for our 
understanding of matter, as you 
will see later in the course. 


Each nucleus of a specific kind (e.g. the uranium isotope 7°°U) has the same 
probability of decaying in a short time interval ôt. We write this probability as 

A ot, where À is called the decay constant of the given type of nucleus. The decay 
constant has the same value for all nuclei of the given kind and is independent of 
time. It describes the statistical behaviour of a large number of nuclei, but does 
not tell us when an individual nucleus will decay. For example, ?°°U nuclei have a 
very small decay constant, and typically survive for billions of years, but an 
individual ?SU nucleus could decay within the next second. Nuclei do not have 
internal clocks telling them when to decay, and a nucleus that has been around for 
a long time is no more likely to decay at a certain instant than one that has just 
been created. 


If at time t there are N (t) nuclei of a given kind then, in the small time interval 
from t to t + ôt, the decrease in N is the number of nuclei that decay in the given 
interval. The average number of nuclei decaying in the given interval is equal to 
the number V(t) of nuclei that were present at the start, multiplied by the 
probability A ôt that any one of them will decay. Hence, on average, the number 
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of nuclei changes over the interval by 
ON = —AN ôt. (1.3) 


The minus sign shows that the number present, N, decreases as nuclei decay. The 
N on the right-hand side reflects the fact that the number decaying at some time t 
is proportional to the number present. We stress that if \ ôt is the probability of a 
nucleus decaying in time ôt, then NVA ôt is the average number decaying in time 
ôt when there are N nuclei present, so this is a statistical law. From Equation 1.3 
we see that, dividing by ôt and taking the limit of a small time interval, the 
number N (t) of the original kind of nucleus falls at a rate 


dN 

— = —)\N(t). 1.4 

T (0) (1.4) 
It is easy to confirm that Equation 1.4 is satisfied by the exponential function 

N(t) = N(0)e. (1.5) 


The time taken for half the nuclei to decay, on average, is called the half-life, 
Values of T} /2 cover a very T}/2. From Equation 1.5 we have 1 {2 = e— 71/2, from which it follows that 
wide range. For 2°?Th 

í In2 0.693 

Tyj2 = 1.4 x 101° years; for Tij = ~ aoe Ge (1.6) 
2MPo, Tyo = 0.3 x 107% s. 

and hence 

N(t) = N(0)e~°-9934/T12, (1.7) 


This equation says that, if there are N (0) nuclei when we start the clock, then 
when ¢ = T} j2, one-half will remain and when ¢ = 27} j2, one-quarter will 
remain, and so on. Since decay is a statistical process, these numbers of nuclei 
remaining are true on the average, and are not accurate when JN is small, so it is 
not true that, starting with one nucleus, half a nucleus will remain after time Tj /2. 


The exponential law of Equation 1.6 describes how Rutherford observed nuclei to 
decay. At the time of its discovery, it would have been natural to suppose that this 
law is a consequence of some deterministic mechanism in nuclei. If nuclei 
behaved like complex classical systems, one would suppose that the decay time of 
any individual nucleus could be predicted if only we knew enough about its 
internal dynamics. In a complex classical system, such as the jar of numbered 
balls used in a lottery draw, apparently random behaviour emerges from very 
complicated dynamics. If we knew the exact positions and velocities of the balls 
and all the forces acting on them, we could dream of predicting the result of 

the draw; only the complexity of the calculation would thwart us. But such 
calculations are impossible for a nucleus, even in principle. There are no hidden 
parameters to distinguish individual nuclei and allow their decay times to be 
predicted. This is why radioactive decay is so startling. All?°°U nuclei that are in 
their ground state, for example, are identical even though they will not decay at 
identical times. This is utterly beyond explanation by classical physics. 


The emission of photons by atoms is also inherently indeterministic; the half-life, 
T\/2, of an atomic state as it waits to decay to a state of lower energy is typically a 
few nanoseconds. The decay from state to state by photon emission follows the 
same general rule for all the systems shown in Figure 1.3, although the half-lives 
vary widely. All hydrogen atoms in the same excited state are identical structures 
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of an electron and a proton, but they decay at random. No internal clocks tell 
them when to decay. In principle, events in the macroscopic world, such as the 
switching on of a light, could be triggered by random nuclear or atomic processes: 
in this sense the world is fundamentally probabilistic. Nevertheless, there are 
many entirely predictable things, such as the fact that the half-lives measured for 
any two large collections of identical nuclei will always be the same wherever and 
whenever they are measured, that copper will always be reddish in colour and the 
energy levels of hydrogen will always be the same. This interplay between the 
absolutely predictable and the absolutely unpredictable is a characteristic feature 
of quantum physics. We shall see several examples in this chapter. 


Exercise 1.3 The Cassini spacecraft sent to Saturn was powered by energy 
released by the radioactive plutonium isotope, 7°°Pu, which has a half-life of 
87.7 years. What fraction of the power produced when Cassini was launched 
would still be produced on arrival at Saturn after 7.30 years? a 


1.2.2 An application of quantum indeterminism 


Quantum indeterminism is not confined to decaying atoms and nuclei, but 
pervades the microscopic world. Practical devices that make essential use of 
quantum chance include quantum random number generators, that have been 
developed and built by physicists in Vienna, Geneva and elsewhere. 


Random numbers are very important in modern science and engineering and are 
required for a wide range of computational procedures and simulations, for 
cryptography, and even for slot machines and lottery games. Some of these tasks 
are very demanding, needing very rapid generation of numbers, and requiring 
‘true’ random numbers rather than the so-called pseudo-random numbers 
generated by computer algorithms; present-day computers are deterministic and 
cannot provide truly random numbers. A quantum random number generator 
provides a rapid source of numbers that are guaranteed to be truly random. 


The device is based on a half-silvered mirror, a mirror 

coated in such a way that if many photons fall on any part E] 
of it, at an angle of incidence of 45°, half are transmitted } / mirror 

and half are reflected. Such a mirror is an example of a single-photon 

beam splitter. In Figure 1.4 photons of light fall one-by-one 
upon a half-silvered mirror. Half of these photons pass 
straight through and half are reflected through 90°. That 45° 

simple statement conceals a remarkable quantum subtlety. photon detector ‘1’ 
Each photon in some sense actually goes both straight through 

and is reflected at 90°, but is only detected at one of the 

two detectors, a remarkable and counter-intuitive quantum 

feature that we shall meet many times in this course. 


source 


: ar ue hoton detector ‘0’ 
Which of the two detectors fires when a photon is incident i 


on the mirror is completely random, and we can no more say 


which it will be than we can say when a specific radioactive Figure 1.4 Photons from a source emitting 
nucleus will decay. But just as there is a statistical law that one photon at a time fall on a half-silvered 
describes the decay of a nucleus in terms of its decay constant mirror and are detected with equal probabilities 
A, so we can say that, at least in the ideal case, 50% of the at detector ‘1’ and detector ‘0’. 
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Figure 1.5 Four quantum 
random number generators 
ready to slide into a spare slot of 
a PC. Produced by id Quantique 
of Geneva. 


Figure 1.6 Thomas Young 
(1773-1829), who established 
the wave nature of light. 
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incident photons will be detected by detector ‘1° and 50% by detector ‘0’. The 
probability of a photon being detected by detector ‘0’ is 0.5, and the probability of 
it being detected by detector ‘1’ is also 0.5. The sum of these probabilities is 
equal to unity, since the photon is certain to be detected at either ‘0° or ‘1’. This 
behaviour is very different from that of a classical electromagnetic field: if an 
electromagnetic wave is incident on a half-silvered mirror, then, according to 
classical physics, there would be a field with half the incident intensity at each 
detector no matter how weak the field. But a photon must be detected in one 
detector or the other; in this respect photons behave like particles. 


The two detectors in Figure 1.4 are labelled ‘0’ and ‘1’ for a good reason: photon 
counts in these detectors are interpreted as binary digits 0 and 1 respectively. It is 
not difficult to extract true random numbers from the random sequence of Os and 
1s. A commercial random number generator based on a half-silvered mirror was 

marketed late in 2004, and is shown in Figure 1.5. 
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1.3 Light as waves — light as particles 


The photon concept was a radical departure from the wave picture that had 
become a cornerstone of physics. Although Newton put forward cogent reasons 
for regarding light as a stream of corpuscles, it was the wave theory developed 
by Huygens and others that became widely accepted at the beginning of the 
nineteenth century when optical interference and diffraction were demonstrated 
by Thomas Young (Figure 1.6) and his successors. 


Interference describes the effects observed when two or more waves are 
superposed, leading to a reinforcement in the disturbance in some places and a 
cancellation in others. Diffraction describes the spreading of a wave that occurs 
when it passes through an aperture or around an obstacle. The two phenomena are 


1.3 Light as waves — light as particles 


linked because the details of diffraction can be explained by the interference of 
many waves taking different routes. Interference and diffraction are characterized 
by patterns of intensity which can only be understood in terms of waves. Other 
experiments showed that light can be polarized, implying that its waves are 
transverse, rather than longitudinal. In other words, whatever vibrates must 
vibrate perpendicular to the direction of propagation. This picture was completed 
in the 1860s, when Maxwell showed that transverse electromagnetic waves, 

with exactly the speed of light, are a consequence of electromagnetic theory, 

as summarized by Maxwell’s equations. The existence of these waves was 
experimentally verified by Hertz in 1887. 


Einstein’s radical proposal of 1905 that electromagnetic radiation comes in 
packets, later known as photons, appeared deeply inconsistent with Maxwell’s 
theory and can still seem unsettling: does light consist of particles or waves? We 
now understand light to be neither just waves nor just a stream of particles; it 
behaves like nothing else but itself. Some experiments reveal wave-like properties, 
some reveal particle-like properties. We say that light exhibits wave—particle 
duality. This is not an embarrassed avoidance of the question: ‘is light particles 
or waves?’ When we think about it, there is no reason why light should behave 
like water waves or like bullets; many things in the quantum world have no 
correspondence with anything in the everyday world. Why, after all, should they? 


Describing waves 


RV 


Figure |.7 A sinusoidal plane wave moving along the z-axis. At a given 
time, the wavelength A is the distance between two successive points at 
which the wave is at the same stage of its cycle. At any given position, the 
period T is the time taken for the wave to complete one cycle. 


It may be helpful to review the definitions needed to describe waves moving 
in one dimension (see Figure 1.7). The expression 


u(x,t) = Acos(27(a/A — t/T)) = Acos(kax — wt) (1.8) 


represents a sinusoidal plane wave moving along the x-axis with 
wavelength \ and frequency f = 1/7, where T is the period. We mostly 
use the second form of the equation, involving the wave number k = 27/ 
and the angular frequency w = 2r f. The positive constant A is called the 
amplitude of the wave. 


The wave 


u(x,t) = Acos(kx — wt + ¢) (1.9) 


I? 
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has an additional phase constant ¢. The argument of the cosine, 

kx — wt + @, is called the phase of the wave; this is not to be confused with 
the phase constant, which is equal to the phase at x = 0 and t = 0. Phases 
will be important when we discuss interference effects. 


The speed of the wave in Equation 1.8 or 1.9 is given by v = fA = w/k. For 
light in free space (not in a medium like glass) this is the speed of light in a 
vacuum, c. 


1.3.1 Light behaving as waves 


Figure 1.8 shows examples of the patterns made by light passing through 
apertures and falling upon a screen. (The screen is really much more distant 
from the apertures than can be shown in the diagram.) Figure 1.8a shows the 
diffraction pattern produced when light passes through a single narrow slit S4. 
The overall size of the diffraction pattern depends on the size of the aperture: in 
Figure 1.8b the pattern broadens out when we use a narrower slit S2. Light of a 
longer wavelength also leads to a broader pattern but this is not shown here. The 
wave nature of light was not established until around 1800, partly because the 
wavelength of light is very small compared to the size of everyday objects. 


The pattern in Figure 1.8c made Young famous. It occurs when two very narrow 
slits S and S4 are spaced so that their broad diffraction patterns become 
superimposed and interference takes place. The characteristic feature of this 
two-slit interference pattern is the series of dark interference minima, 
separated by bright interference maxima. The dark regions would have been 
bright with either slit alone. The wave picture explains these as the places where 
the disturbances caused by the wave that originates from Sg cancel, or nearly 
cancel, the disturbances caused the wave originating from S4. The details of the 
two-slit pattern depend on the width and separation of the slits; widely-spaced 
narrow slits can lead to a regular sequence of light and dark bands. 


1.3.2 Light behaving as particles 


In Section 1.1 we referred to photons as ‘packets of electromagnetic radiation’. 
These are the original ‘light quanta’ of energy hf introduced by Einstein in a 
famous paper of 1905 in which, among other things, he gave an explanation of the 
photoelectric effect. The mature concept of a photon with particle-like properties 
required some 20 more years to develop. Einstein was perplexed that, although 
electromagnetic theory says that an atom should radiate in all directions like 

a radio antenna, atoms seemed to radiate in a single direction as if emitting 
particles. In an important paper in 1916, in which he tried to reconcile these ideas 
(and in which he uncovered the underlying principle of lasers) he referred to 
‘radiation bundles’. When in 1923 Compton showed that these radiation bundles 
had an associated momentum vector and transferred momentum to electrons (the 
Compton effect), Einstein immediately recommended him for the Nobel prize in 
physics. We now understand that a photon of frequency f carries momentum of 
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magnitude 


h h 
Pphoton = hf = x? (1.10) 


where À is the wavelength. Photons are detected as localized particles when a 
single grain darkens on a photographic film, or a Geiger counter clicks in response 
to the passage of a gamma-ray photon. 


incoming light 
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detecting sereen 


(a) (b) (c) 


Figure 1.8 (a) A narrow slit, S4, in an absorbing screen generates a diffraction 
pattern on a distant detecting screen (not shown to scale). The intensity J of the 
light on each point on the screen is indicated by a coloured curve. Bright regions 
are marked B, dark D. The slit extends perpendicular to the plane of the page, as 
does the pattern on the detecting screen. (b) When a narrower slit Sz is used, the 
diffraction pattern becomes more spread out. (c) When there are two very narrow 
identical slits, S3 and S4, separated by a distance much greater than the width of 
each slit, a characteristic two-slit interference pattern appears. In reality, the slits 
would be narrow compared to À, the wavelength of the light. 


Exercise 1.4 (a) What is the magnitude of the momentum carried by a single 
gamma-ray photon of energy 1 MeV? (b) Use the equivalence of force and rate 
of change of momentum to determine how many such gamma-ray photons per 
second, falling perpendicularly onto a block of lead and absorbed by it, would 
exert a force of 1 N. (c) How many AM radio photons per second would exert the 
same force? (You can use the result of Exercise 1.1.) a 
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Figure 1.9 Louis de Broglie 
(1892-1987), who proposed that 
particles such as electrons 

have wave-like properties. He 
received the Nobel prize for 
physics in 1929. 
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1.3.3 Light behaving like waves and particles 


In 1909 G. I. Taylor studied diffraction patterns using light of extremely low 
intensity: so low that a three-month exposure was required to form a diffraction 
pattern on a photographic film. Such low intensities mean that after a short time 
interval, just a few scattered spots appear on grains of the film, apparently at 
random. But after a long time the accumulated spots form the same diffraction 
pattern as a short exposure with high intensity light. 


There are two important features in what Taylor found: 


1. The pattern that eventually forms is that predicted for waves passing through 
the aperture but, when the waves reach the film, they register as just one spot 
at a time. The diffraction pattern corresponds to a set of waves with a 
particular wavelength. But it is as if the waves answer the question ‘Where 
are you?’ by darkening a single sensitive grain in the film, just as a gamma 
ray makes a single click in a Geiger counter. 


2. There are places in the diffraction pattern with very few spots: interference 
minima. Since the diffraction pattern depends on the size of the aperture, 
increasing the size of the aperture will reduce the density of spots at certain 
places (and also increase the density of spots where there were previously 
interference minima). 


From our modern perspective, we can also add: 


3. The regions with a low density of spots are not the result of different 
photons interfering with each other. This has been confirmed by modern 
experiments in which we can be sure that there is only one photon in the 
apparatus at any given time. 


4. The unpredictability of where the next spot appears contrasts with the 
predictable diffraction pattern that eventually emerges. This is analogous to 
the fact that we cannot predict when a particular radioactive nucleus will 
decay, but a collection of many nuclei does have a well-determined half-life. 


Taylor’s experiment is an excellent example of wave—particle duality in action: 
the diffraction pattern is that predicted by a wave theory, but the light manifests 
itself at single points on the screen, like particles. 


Exercise |.5 Look carefully at the first one of the four points listed above. 
There is a close parallel with an aspect of what happens at the half-silvered mirror 
in the random number generator. What is it? a 


1.4 Electrons as particles — electrons as waves 


Even before physicists assimilated the idea that light had a particle-like aspect, 
another bombshell exploded: it was proposed that particles, such as electrons, 
have wave-like properties. This idea was first put forward by Louis de Broglie in 
1924 (Figure 1.9). ‘But surely’, common sense tells us, ‘electrons are not waves! 
An electron has a definite electric charge. How can a wave have an electric 
charge? Electrons are indivisible, while waves simultaneously go both ways 
around an obstacle. Electrons leave spots on a phosphorescent screen like that in 
an old fashioned TV tube.’ 


1.4 Electrons as particles — electrons as waves 


Without doubt, electrons do commonly behave like particles: 
we can trace their tracks as they bend under the influence 

of magnetic fields, as in Figure 1.10, and we can hear the 
clicks made in a suitable Geiger counter by beta particles 
(electrons created and emitted by certain unstable nuclei) as 
they arrive one-by-one. But this does not mean that electrons 
behave only like particles. 


In fact, there is a symmetry in Nature: both photons and 
electrons are detected as localized particles, but travel through 
space as waves. The fact that electrons travel like waves was 
verified by diffraction experiments carried out by C. Davisson 
and L. Germer in the USA and by G. P. Thomson in the 

UK. The diffraction of electrons has important practical 
applications, including electron microscopes. It has now been established 
experimentally that wave-like behaviour occurs for many kinds of particle: 
electrons, protons, nuclei and molecules, although experimental difficulties make 
verification difficult for objects as large as charging elephants. 


1.4.1 The de Broglie relationship 


The scale of a diffraction pattern depends on the wavelength of the wave. So, if 
we are to explain the diffraction of electron waves, we must understand what 
determines their wavelength. This issue was settled by Louis de Broglie. In 1924, 
de Broglie asserted that free electrons (that is, electrons that experience no forces) 
have a wavelength inversely proportional to the magnitude of their momentum, p. 
In fact, 


AdB = is (Ue) 
P 


where h is Planck’s constant. The wavelength Aag is called the de Broglie 
wavelength, and Equation 1.11 is called the de Broglie relationship. 


We now know that all particles have a de Broglie wavelength given by 
Equation 1.11. This includes photons, as you can see from Equation 1.10. 

The validity of the de Broglie relationship for the motion of nuclei (composite 
particles of many protons and neutrons) has been known for many years. More 
recently, Equation 1.11 has been verified for molecules as large as ‘bucky balls’ 
(C60) and even heavier molecules. 


The de Broglie relationship tells us that, the larger the momentum, the smaller the 
de Broglie wavelength. For example, the electrons in the beam of a transmission 
electron microscope have been accelerated to high values of momentum, and have 
very short de Broglie wavelengths, much smaller than the wavelength of light. 
This allows electron microscopes to resolve much finer detail than ordinary light 
microscopes. 


The diffraction of electrons can be exploited to study the structure of matter on an 
atomic scale and below. The rings in Figure 1.11 are formed when an electron 

beam diffracts from the regular array of carbon atoms in graphite, and the spacing 
of the carbon atoms can be deduced from the diameter of these rings. In a similar 


Figure 1.10 Electrons 
moving in a magnetic field 
inside an evacuated glass bulb. 
The bluish circular path is made 
visible by a small amount of gas 
in the glass bulb. 


The subscript dB stands for de 
Broglie. 


Figure I.I] The diffraction 
of high-energy electrons by 
graphite, producing the rings 
shown here, can be used to 
measure the spacing of the 
carbon atoms in the graphite. 
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way, the diffraction of very high-energy electrons by atomic nuclei is the main 
source of information on nuclear sizes. Very high energies are required to ensure 
that the de Broglie wavelengths of the electrons are small enough to resolve the 
nuclear details. 


Do not confuse the de Broglie wavelength with the size of a particle. For particles 
of a given energy, the greater the mass, the greater the momentum, and the smaller 
the de Broglie wavelength. This means that the de Broglie wavelength can be 
much larger or much smaller than the particle itself. For example, the de Broglie 
wavelength of the electrons used by Davisson and Germer was about the size of 
an atom, although electrons are, as far as we can tell, point particles. By contrast, 
the de Broglie wavelength of a uranium nucleus that has been accelerated to 
1000 MeV is about one-hundredth the size of the uranium nucleus. Because of 
their large mass, even slowly-moving macroscopic objects have momenta that are 
enormous on an atomic scale, with correspondingly small de Broglie wavelengths. 
The fact that the de Broglie wavelengths of macroscopic bodies are very much 
smaller than the bodies themselves is an important reason why quantum effects 
are not apparent for everyday objects. 


1.4.2 Diffraction and the uncertainty principle 


A further detail of electron diffraction is that the diffraction pattern becomes 
broader as the aperture becomes narrower. This is exactly the same as for light 
(compare Figures 1.8a and b). It is interesting to note that this feature can be 
interpreted using the uncertainty principle, a fundamental rule of quantum 
mechanics which will be discussed in Chapter 4. In loose terms, the uncertainty 
principle tells us that it is impossible to know both the position and momentum of 
a particle with perfect accuracy. In fact, the more we know about the position of a 
particle, the less we can know about its momentum. Consider an electron wave 
immediately after it has passed through a narrow slit. At this instant, the electron 
is somewhere within the width of the slit, so its coordinate in the direction 
perpendicular to both the slit and to the direction of motion of the beam is known 
quite accurately. If we make the slit narrower, this coordinate will be known more 
accurately, and the uncertainty principle tells us that the corresponding component 
of momentum is known less accurately. The greater uncertainty in transverse 
momentum means that there is a greater uncertainty in where the electron will 
arrive on the screen, and this explains why the diffraction pattern is broader. 


1.4.3 Electrons behaving like waves and particles 


Wave-particle duality for photons was demonstrated by Taylor’s experiment, 
which produced a diffraction pattern (characteristic of waves) formed from a 
series of spots (characteristic of particles). Something very similar occurs with 
electrons and other material particles. 


Figure 1.12 shows an experimental arrangement that is analogous to the two-slit 
interference experiment for light. A beam of electrons is incident on an 
arrangement called a biprism, which mimics the effect of two slits. The electrons 
arriving at a particular point on the detector can pass either side of a filament, 
which is analogous to photons passing through either of two slits in a screen. 
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source of 
clectrons 
Figure 1.12 Schematic 
T i -O A| representation of an 
electron | filament | experimental arrangement 
biprism | © | used by Tonomura in 


Spee ee Saar 1989 to demonstrate the 
wave-particle duality of 
electrons. The two paths of 

screen the electrons around the 

filament are analogous to the 
paths through two slits in an 
otherwise absorbing screen. 


detector 


Figure 1.13 shows how single-electron events build up over a 20-minute exposure 
to form an interference pattern. In (a) just 7 electrons have appeared on the screen 
and it is impossible to discern a pattern. It is still hard to see a pattern with 

270 electrons in (b) though it is discernable in (c) where 2000 electrons have been 
detected. The pattern is quite clear after a much longer exposure involving 60 000 
electrons shown in (d). Here, then, is wave—particle duality for electrons. The 
electrons appear as indivisible particles, arriving all-or-nothing at points on the 
detecting screen but, in the long run, the distribution of spots on the screen 
follows the interference pattern expected for waves. 


Figure 1.13 The step-by-step 
build-up of a two-slit 
interference pattern, each spot 
being made by a single electron. 
Patterns formed with: (a) 

7 electrons; (b) 270 electrons; 
(c) 2000 electrons and (d) 

60 000 electrons. 
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A single electron is not predestined to arrive at a particular point on the screen; 
until a spot irreversibly appears in just one pixel, the electron could have appeared 
at any point where the electron wave was non-zero. One can send any number of 
electrons towards a pair of slits, and they will not end up at identical points on the 
screen any more than identical uranium nuclei undergo radioactive decay at the 
same time. They will form a well-defined pattern, just as identical nuclei have a 
well-defined decay constant, A. 


In summary: an electron passing through two slits provides a good example of the 
interplay between determinism and indeterminism and, at the same time, the 
interplay between wave and particle behaviour. It is typical of the quantum world 
that we cannot say where an individual electron will be detected, but we can 
predict the pattern produced by many electrons. 


1.4.4 Describing electron waves: the wave function 


If electrons behave like waves, there must be a mathematical function that 
describes the value of the wave at each point in space and each instant in time. 
W is the upper case Greek letter This function is called the wave function, and it is denoted by the symbol Y. For 
psi, pronounced ‘sigh’. a single electron, the wave function depends on the position vector r = (x,y, z) 
and time, and so takes the form U(r, t). However, we are often interested in 
situations that can be treated in one spatial dimension, allowing the wave function 
to be expressed as V(r, t). 


As time passes, the electron waves travel through space, much as the sound waves 
travel in a concert hall. The equation that governs how the wave of a single 
electron, or any microscopic system, changes in time is known as Schrédinger’s 
equation. It is explained in Chapter 2. Here, we simply note that Schrédinger’s 
equation takes the wave function V(r, tı), describing the electron for all r at 

time tı, and carries it forward to give U(r, t2), describing the electron for all r at 
time t2. Schrédinger’s equation tells us how an electron wave spreads out as it 
passes around a biprism and on to a detecting screen; this allows us to calculate 
the final interference pattern. 


Leaving aside Schrédinger’s equation for the moment, we still have the major 
task of interpreting the wave function. Because the wave function is defined 
throughout a region of space, it can be thought of as a field, but is unlike any 
classical field you have met. An electric field, for example, has values that are real 
vector quantities. It is easy to measure the electric field at any point by measuring 
the force exerted on an electric charge placed at that point. By contrast, the wave 
function Y is a complex quantity, so it has both real and imaginary parts. Partly 
for this reason, the wave function cannot be measured directly. Nevertheless, we 
can use W to calculate the probability of finding the electron in any small region at 
a particular time. That is clearly appropriate in situations where electrons are 
diffracted, since the place where a particular electron will register cannot be 
predicted, but we can predict the diffraction pattern formed by many electrons, 
and the maxima in that pattern are the places where the electron is most likely to 
be found. The precise statement of how W(x, t) is used to predict probabilities is 
given by Born’s rule, named after Max Born (1882-1970), who belatedly 
received the Nobel prize for physics in 1954. 
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Born’s rule 


The probability of finding the particle at time t in a small volume element 


ôV, centred on position r, is 
|| is the modulus of the 


probability = | ¥(r, t)|? ôV. (1.12) complex quantity Y. 


For systems that are modelled in one dimension, the probability of finding 
the particle at time ¢ in a small interval ôx, centred on position 2, is 


probability = |Y (x, t)|? dx. (1.13) 


Note that the probability is proportional to the real quantity |Y |°, and is not 
given by Y or WV. This is because Y has complex values, and the probability 
must be a real number. 


For the specific case of an electron diffracted by a slit and striking a screen with 
sensitive pixels, Born’s rule says: the probability of the electron being detected in 
a given pixel is proportional to the size of the pixel, and to the value of |Y Ê at the 
position of the pixel. We cannot predict where a particular electron will appear, 
but the pattern formed by many electrons is predicted by Schrédinger’s equation, 
which governs the way W spreads out from the slit to the screen. 


Because the wave function is complex, you will need to manipulate complex 
numbers throughout your study of this course. It is worth refreshing your 
memory of them by reading Section 8.1 of the Mathematical toolkit now. 
You may wish to spend a significant amount of time on this, as it will be a 
good investment for future chapters. 


The rules given in the Mathematical toolkit for complex numbers apply equally 
well to functions with complex values. Here are the most important points to 
remember. 


1. Any complex function, f(x) can be expressed as a sum of real part u(x) and 
an imaginary part v(x): 


f(z) = u(x) + iv(a), (1.14) 
where u(x) and v(x) are real-valued functions. 


2. The complex conjugate of f(x) is written as f*(a) and given by 
f*(x) = u(x) — iv(z). (1.15) 


3. The modulus, or magnitude, of f(x) is written as |f (x)|. Dropping the 
argument x for simplicity, this can be calculated as 


fl? = ff = (u—iv)(u + iv) = u? + 07. (1.16) 

4. Any complex function, f, can also be expressed in terms of its modulus |f | 
and phase ¢: 

f= flé. (1.17) 
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Figure 1.14 The upper graph 
shows |Y (x, t)|? for an electron 
at an instant of time and the 
lower graph shows it at a later 
instant. Regions A, B and C are 
referred to in the text. 
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Since 
e'? = cos¢ + isind, 


the real and imaginary parts of f are, respectively, Re( f) = | f| cos ¢ and 
Im(f) = |f| sin ¢, which checks out since 


If = (|f| cos)? + (|f| sing)? = |f. 


This will often appear in this course in the form: 
ff? = fF =[fle“® x [fle = [f[e? = [FP 

Exercise 1.6 Figure 1.14 shows |Ù (x, t)|? for an electron at two closely 
spaced instants in time, the lower graph representing the later time. Three regions 
along the z-axis, A, B and C are marked. In which of these regions in the upper 
graph would the electron be most likely to be found, least likely to be found, and 
somewhere in between? Remembering that the lower graph represents a later 
instant in time, would you say that the electron is moving in the direction of 
increasing or decreasing x? 


Exercise |.7 In the last exercise, we concluded that the electron was moving 
in the negative x-direction. Does this mean that the electron would inevitably be 
found at a smaller value of x at the later time? | 


1.4.5 Particles and the collapse of the wave function 


What does it really mean to say that an electron, or any other entity, is a particle? 
Here is a reasonable definition, but one with far reaching consequences in the 
context of quantum mechanics. 


In quantum mechanics, a particle is an entity which is found in only one 
place when its position is measured. 


The point is illustrated by the phenomenon of electron diffraction. When a single 
electron leaves a slit, there is a certain volume within which the wave function 
U(r, t) is non-zero, and outside which the wave function is practically zero. The 
electron may be found anywhere in the volume where ||? > 0. The motion 

of this volume follows from Schrédinger’s equation, which governs the time 
development of Y (r,t). By the time the electron wave reaches the screen, it is 
spread out over a wide area. 


The screen is covered with pixels (arrangements of atoms that can record the 
arrival of an electron). The arrival of the electron wave at the screen is marked by 
a permanent change in one of these pixels. Crucially, only one of the pixels 
responds and the others remain unchanged. It is as if the electron is asked by the 
screen where it wishes to appear. The electron responds by triggering a reaction in 
one of the pixels, and automatically avoids reacting with any other. If we try to 
interpret this in terms of news spreading out from the responding pixel, telling 
other pixels not to respond, the news must spread out instantly. This is an example 
of quantum non-locality, in which what happens at one place affects what happens 


1.4 Electrons as particles — electrons as waves 


at other places in a way that cannot be explained by communication at the speed 
of light, the maximum speed at which any information can be transmitted. The 
detection of the particle at one and only one location implies that the wave 
function has somehow ‘collapsed’ onto a single pixel in a sudden and abrupt 
process, called the collapse of the wave function. This is yet another aspect of 
the quantum revolution that will be discussed in detail later in the course. 


1.4.6 The de Broglie wave function 


Although de Broglie’s ideas preceded the development of Schrédinger’s equation 

and Born’s rule, it is convenient to call the wave function U(r, t) for a free 

particle a de Broglie wave function. We shall consider the case of a free particle 

moving with momentum p and energy F in the positive x-direction. To simplify 

the description, we ignore any motion in the y- and z-directions, and describe the 

particle by a wave function Yag (x, t). What would we guess for the form of this Remember that the subscript dB 
wave function? At first sight, we might suppose that stands for ‘de Broglie’. 


WVap(z,t) = Acos(kax — wt) (FIRST GUESS!), (1.18) 


as in Equation 1.8. 


In fact, this first guess is not quite right since wave functions, including Yag, are 
complex. Equation 1.18 is just the real part. Instead we have: One may use either of the 
—_ equivalent expressions, 
Wap(az,t) = Alcos(kx — wt) +isin(ka — wt)] ei Whatever or exp (j whatever) 


= Aexpli(kx — wt)] = Aei(ke—wt). (1.19) according to typographical taste. 


The constants k and w that characterize the de Broglie wave can be related to 
properties of the corresponding free particle. The wave number k is given by 
k = 27/2. Combining this with the de Broglie relationship, A = h/p, we see that 


27p p 
k= — = Ł. 
h h 
For light, we know that the angular frequency is related to the photon energy by 
E = ħw. We shall assume that a similar relationship applies to de Broglie waves, 


so that 


Remember, fi = h/2r. 


U S 


h 


Because de Broglie waves describe free particles, with no potential energy, the 
energy E can be identified with the kinetic energy, Fj, of the particle. Thus: 


The de Broglie wave function in Equation 1.19 represents a free particle of 
momentum ñk and kinetic energy hw. 


The constant A is complex, and may be written in exponential form as 
A = |Ale'®. This means that the de Broglie wave function can also be expressed 
as 


Wap (x,t) = Aji ketto), (1.20) 
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The following exercise shows that this wave function describes a particle that is 
equally likely to have any value of x. 


Exercise 1.8 Show that the wave function of Equation 1.19 represents a 
particle that has the same probability of being found in any small interval of fixed 
length dx, no matter where this interval is placed along the x-axis. a 


The wave function Yap is clearly an idealization — we generally deal with 
particles that are more likely to be found on Earth than in another galaxy! Even 
so, idealizations have a role to play in physics. Later in the book you will see how 
more localized wave functions describing free particles can be constructed, using 
de Broglie waves as building blocks. 


1.5 The origin of quantum interference 


This section takes another look at the diffraction and interference patterns 
produced by electrons and photons. This time, we will concentrate on describing 
such phenomena using the language of quantum physics. In the case of electron 
diffraction, this means interpreting the diffraction pattern in terms of the wave 
function. In the case of interference by photons, our description will involve a 
new concept called a probability amplitude. The section will also introduce a 
fundamental rule, which we shall call the interference rule, which lies at the heart 
of quantum physics. 


1.5.1 Electron interference caused by a double slit 


Let’s return to Figure 1.8c, which shows the interference pattern due to waves 
passing through a double slit. There is nothing mysterious about this pattern if one 
thinks of it in terms of water waves. At some places, the wave coming from one 
slit tends to produce the same disturbance as the wave coming from the other 

slit. At these points the waves reinforce one another and produce a maximum 
(constructive interference). In the case of water waves, if you measured the 
strength of the wave by the up-and-down displacement of a floating cork from its 
mean position, then the displacement would be greatest at these points. At another 
series of points, the waves coming from the two slits would produce disturbances 
that cancel, and a minimum would be observed (destructive interference). In the 
case of water waves, a cork placed at one of these points would remain at rest. 
Notice two things. 


1. Nothing would change if the waves falling on the slits were very weak 
except that the periodic displacement of the cork would also be very weak. 


2. The displacement at any point is just the sum of the displacement that would 
be there if only one slit was open, plus the displacement that would be there 
if just the other slit was open. 


More precisely, if D(t) is the displacement of the cork from its mean position at 
time t due to a wave coming through slit 1, and D2(t) is the displacement of the 
cork from its mean position at time ¢ due to a wave coming through slit 2, then 
when both slits are open the displacement is 


total displacement of cork at time t = Dı (t) + Do(t). (1.21) 


1.5 The origin of quantum interference 


Now consider what happens when electrons fall on a pair of slits. The previous 
section described an experiment which simulated the effect of two slits by using a 
biprism (Figure 1.12), and the overall pattern produced by many electrons was 
shown in Figure 1.13d. This is very similar to the interference pattern produced 
by water waves, but there is one crucial difference. The electrons appear as 
indivisible particles, arriving all-or-nothing at points on the screen. This is unlike 
the behaviour of classical waves such as water waves. Water waves of very low 
intensity would produce a weak but continuous interference pattern that would 
cause a floating cork to oscillate everywhere except at the interference minima. 


The observation of discrete spots produced by individual electrons might lead you 
to think that the electrons travel from the slits to the screen as discrete particles, 
but this cannot be true either. It does not begin to explain why the pattern of spots 
follows the outline of an interference pattern. If the electrons travelled to the 
screen as particles, the number of electrons arriving at any given point on the 
screen would be the sum of those arriving from slit 1 and those arriving from 

slit 2. Closing off one of the slits would always reduce the number of electrons 
arriving at the screen. But this is not what is observed. Closing off one of the slits 
converts a two-slit interference pattern into a single-slit diffraction pattern. These 
two patterns have minima in different places, so a point that was a minimum for 
the two-slit interference pattern will no longer be a minimum for the single-slit 
diffraction pattern. Closing off one of the slits therefore causes an increase in the 
number of electrons arriving at this point. Such behaviour cannot be understood in 
terms of a particle model, but is explained by the interference of waves. This, of 
course, brings us back to the conclusion of a previous section — electrons 
propagate as waves and are detected as particles — they display wave—particle 
duality. 


How are these facts explained in quantum mechanics? Fortunately, the concept of 
a wave function, together with Born’s rule, allows us to make a consistent 
interpretation. The wave function propagates through space much like water 
waves, though there are some differences in detail. Water waves obey a certain 
partial differential equation while the wave function Y obeys another (the 
Schrödinger equation). Such differences do not matter here; the only important 
point is that the wave function propagates as a wave. The positions of the maxima 
and minima in its interference pattern depend on geometric factors, such as the 
spacing between the slits and their distance from the screen. However, the wave 
function itself is not directly measurable. There is no ‘bobbing cork’ that can be 
used to find the value of the complex wave function. Instead, we have Born’s rule, 
which tells us that the probability of finding an electron in a small region is 
proportional to the square of the modulus of the wave function in that region. 
Each electron appears as a discrete spot at a random location but, in the long run, 
the overall pattern of spots follows the probability distribution set by |W|?. In 

this way, the wave function, and its interpretation, combine both wave- and 
particle-like aspects. 


To give a more detailed account of electron interference, let us denote by Y (r, t) 
the wave function that emerges from the two slits. It turns out that this wave 
function can be expressed as the sum of two terms: 


U(r, t) = V(r,t) + Va(r, t), (1.22) 
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where Wj (r,t) is the part of the wave function associated with propagation 
through slit 1 and Y2(r, t) is the part of the wave function associated with 
propagation through slit 2. The terms are scaled in such a way that the complete 
wave function, Y, describes one electron that passes through either of the slits, 
although we cannot say which one. 


It follows from Born’s rule that the probability of finding an electron in a small 
volume element dV, located at r, is 


probability = | U(r, t)|? 8V = |W, (r,t) + Vo(r, t)|? dV. (1.23) 
Omitting the arguments of the functions for clarity, this can be rewritten as 
probability = (Y1 + W5)(W1 + Y2) ôV 
= (WW, + WW. + WW. + WBW1) V 
= (1|? + |Wol? + ViVa + VW) ov. 


If we had squared the moduli first, and then added the squares, we would have 
obtained |W, |? + |W2|?, which represents the sum of two single-slit diffraction 
patterns. By adding the wave functions first, and then taking the square of 

the modulus of the sum, we get an additional term (U}WV2 + V3W,), known 

as the interference term. The interference term can be positive or negative, 
corresponding to constructive or destructive interference. When it is included, the 
contributions from the two slits cancel at points where Vj (r,t) and W2(r, t) have 
the same magnitude and opposite signs. This is like the water waves from two 
slits cancelling in Equation 1.21, except that here we are dealing with a wave 
function that represents the state of a single electron. In summary, we can make 
the following statement: 


To allow for the interference of two terms in a wave function, it is essential 
to add the terms before taking the square of the modulus. 


Exercise |.9 Consider two functions: f(x) = sin x and g(x) = cos x. Which 
of the functions p(x) = | f(x) + g(a)|? and q(x) = | f(x) |? + |g(z)|? is zero for 
certain values of x and which is never zero? E 


In the case of two-slit electron interference, the presence of interference maxima 
and minima implies that waves come from both slits. These waves describe a 
single electron, so we are unable to say which slit the electron passed through. In 
other words, we must abandon the classical idea of a trajectory. An electron 
might originate within a source, be accelerated towards the pair of slits, and end 
up being detected in a particular pixel, but we absolutely cannot say that it follows 
a particular path from the source to the place where it was detected. We cannot 
say which slit it went through. 


If we did somehow determine which slit the electron went through, and this left a 
record of some kind (referred to as “which-way’ information), it would be wrong 
to add the two wave functions before taking the square of their modulus. If an 
electron is known to pass through slit 1, its wave function will be proportional to 
Y4, and there will be no interference with V2. The pattern produced by such 
electrons is that for a single slit: the two-slit interference pattern is destroyed 
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when information is collected about which slit the electron passes through. This 
has been verified experimentally. 


Abandoning trajectories might make you feel uneasy. How do we reconcile it with 
the fact that electrons and other subatomic particles sometimes do appear to have 
trajectories, as revealed by cloud-chamber tracks (Figure 1.15)? The answer is 
that the cloud chamber provides which-way information. If a charged particle 
emerged from a double slit into a cloud chamber, you would see just one track and 
certainly not a two-slit interference pattern. As soon as the particle enters the 
chamber and induces a water droplet to form, the wave function collapses. The 
water droplet records which slit the particle passed through and this ‘decides’ the 
direction of motion of the particle. The particle maintains substantially the same 
direction of motion in the medium until it slows down or is deflected by a close 
encounter with a nucleus. 


Figure 1.15 Alpha-particle 
tracks in a cloud chamber. 
Each track was made by a 
single alpha particle from a 
radioactive source, which 
ionizes a trail of molecules in 
air containing supersaturated 
water vapour. The ions 
become seeds upon which 
water droplets form. 


1.5.2 Probability amplitudes 


Our interpretation of electron diffraction in terms of wave functions can be 


used to describe other processes, but there are some limitations. For example, The term probability amplitude 
Schrédinger’s equation applies to non-relativistic material particles, such as is part of the language of 
electrons, but it does not apply to photons. Fortunately, there is another way of quantum physics. It should not 
calculating probabilities, but to explain it we need to introduce a new concept, that be confused with the amplitude 
of a probability amplitude. of a wave. 


You will recall that Born’s rule tells us that the probability of finding a particle in 
a given region is proportional to the square of the modulus of the wave function 
in that region. The idea of calculating a probability by taking the square of 

the modulus of something is deeply embedded in quantum mechanics. The 
‘something’ we need to take the modulus squared of is called a probability 
amplitude. So, we have: 


probability = |probability amplitude|?. 
More formally, if a process has a number of possible outcomes, and we associate 
a probability amplitude a; with outcome i, the probability of this outcome is |a;|?. 


Probability amplitudes can be used in situations where wave functions are difficult 
to use. If we wished to calculate the decay probability \ for a nucleus, we would 
first calculate a probability amplitude a, and then calculate \ = |a|?. When 
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The adjustment of mirror M1 is 
very sensitive: changing the 
difference in path length by 
half a wavelength changes 
constructive interference to 
destructive interference. 
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photons are reflected from half-silvered mirrors, the probability amplitudes for 
going through the mirror and for being reflected both have magnitudes equal to 
1/V2, so the corresponding probabilities are both equal to 1/2. 


The rule for adding wave functions before taking the square of their modulus has 
its counterpart for probability amplitudes. In fact, we have the following general 
rule, which we shall informally call the interference rule. 


The interference rule 


If a given process, leading from an initial state to a final state, can proceed in 
two or more alternative ways, and no information is available about which 
way is followed, the probability amplitude for the process is the sum of the 
probability amplitudes for the different ways. 


When calculating the probability of the process, it is essential to add all the 
contributing probability amplitudes before taking the square of the modulus. 


In the next section, you will see how this rule helps us to explain the interference 
of photons in the laboratory. 


1.5.3 Photon interference in the laboratory 


H1 adjustable 
mirror 
M1 


Pl Figure 1.16 Light enters 
from the left, striking a 
half-silvered mirror, H1. It 


Db can then take two possible 
D paths P1 and P2, being 
r a HD reflected off mirrors M1 and 
M2 M2 before passing through a 
second half-silvered mirror, 
wy Da H2, to be detected at either of 


the detectors Da or Db. 


In Section 1.2.2 we saw a half-silvered mirror in action, producing random 
numbers. But the fun with half-silvered mirrors really starts with two of them and 
they provide a nice example of the principle of adding probability amplitudes 
before you square. As shown in Figure 1.16, a Mach-Zehnder interferometer is 
an arrangement of two half-silvered mirrors, H1 and H2, which act as beam 
splitters and two fully-reflecting mirrors, M1 and M2. This figure shows the two 
possible light paths, P1 and P2, leading from H1 to H2. Note that paths P1 and P2 
do not have exactly identical lengths, and their relative length can be adjusted by 
making tiny adjustments to the position of mirror M1. It would be natural to 
suppose that, when light following both paths reaches H2, half the light is detected 
at one detector Da, and half is detected at the other detector Db. Indeed, this is 
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exactly what happens when either one of the paths is blocked. If only path P1 is 
open, then the light on this path is split by H2 to be detected at detector Da or Db, 
with half going each way. The same thing happens when only path P2 is open. 


With paths P1 and P2 both open, we find that, as the length of path P1 (say) is 
continuously adjusted, the light intensity measured at Da oscillates from zero to a 
maximum value that corresponds to the detection of all incident photons. The 
light measured at Db does the same, but in such a way that the maxima occur for 
the same path lengths that give minima at Da. This behaviour can be understood 
for classical light as an interference effect. If the two path lengths are such that the 
waves coming via P1 and P2 are exactly in phase (wave maxima matching) when 
emerging towards Da then the two waves reinforce one another and all the light is 
detected at Da. If this happens, there will be no signal at Db. All intermediate 
cases are covered by varying one of the path lengths, as shown in Figure 1.17. 


Figure |.17 Light signals 
at detectors Da and Db vary 

; nace as shown when the phase 
ee ee eee difference between the waves 
travelling along P1 and P2 is 
continuously changed by 
varying a path length. The 


8 $ : s: 
Q wal aval sZ Ai 
phase diference 


300- Db maxima in one counter 
£ Bs Re S A coincide with minima in the 
22007 eee ^ E, other. The dots indicate the 
“i00 B 3 a $ fo; poo statistical nature of the 


counts in both counters. 
(Experimental results due to 
Grangier et al., 1986.) 


pea i es eA 
phase diference 


With a weak light source sending just one photon into the interferometer at a time, 
exactly the behaviour predicted by the classical wave picture occurs if the photons 
are counted for long enough. That is how Figure 1.17 was obtained, with each 
detector Da and Db counting individual photons. So let us now turn to the 
quantum description in terms of probability amplitudes. 


Each photon has some probability amplitude a; for reaching detector Da via P1, 
and a probability amplitude az for reaching the same detector via P2. These 
depend on the lengths of paths P1 and P2 respectively. Since there is no way of 
determining which path was taken by the photon, we must use the interference 
rule for probability amplitudes: we add the probability amplitudes before squaring 
the modulus. The total probability of detection at Da is therefore 


Pa = [a1 + aof’. (1.24) 
In general a = |a |e! and az = |ag|e'*? are complex numbers. If the path 
lengths are such that 6; = 02, these probability amplitudes add constructively and 
Pa is a maximum; if 0; = 02 + 7, the probability amplitudes add destructively and 
Pa is a minimum. 


A very similar description applies at the other detector. Let the probability 
amplitudes for being detected at Db via P1 be b; and via P2 be b2. Then the 
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probability of detection at Db is 
Po = |b + b2|?. 


This turns out to be small when p, is large, and vice versa, as shown in 
Figure 1.17. 


Exercise 1.10 Assume that the path lengths are such that pp = 0. What will 
be found at Db if, in an effort to determine which path a photon takes, path P1 is 
blocked? What will pp be in this case? 


Exercise |.| | Imagine that a Mach-Zehnder interferometer has one path 
blocked, and an incident photon can appear at either of the detectors. Explain why 
un-blocking the blocked path can, if the path lengths are appropriate, stop photons 
being detected by one of the two detectors. Explain both for classical waves and 
for photons entering the interferometer one at a time. E 


We now have a more satisfactory way of speaking about particles that encounter a 
beam splitter or a double slit. Instead of saying rather vaguely, as we did in 
Section 1.2.2, that ‘... each photon in some sense actually goes both straight 
through and is reflected at 90°...’ we can say that the photon is in a superposition 
of two states: one in which it travels one way, and one in which it travels the other. 
If this still sounds mysterious, it is because it matches nothing in our everyday 
experience of the world. One could say of superposition, like other key quantum 
notions, that it is really more a matter of getting used to it than of understanding it 
... especially if, by ‘understanding’, you mean having a mental model involving 
everyday things that exactly parallels what is going on at the quantum level. This 
course will give you plenty of opportunity to get used to the idea of quantum 
superposition. 


1.6 Quantum physics: its scope and mystery 


Quantum physics is the foundation of our understanding of the material world 
around us, of the energy source of stars, and of the existence of galaxies. Why, for 
example is copper reddish and gold yellowish? The answer lies in the way 
electrons in metals obey quantum rules, and the way light interacts with these 
electrons according to quantum rules. 


When do we need quantum physics? There are many phenomena where classical 
physics gives practically the right answers. For example, we can explain the 
reflection of radio waves from a sheet of copper using classical physics, but 
quantum physics is required to explain why light reflected from copper appears 
reddish and why gold has its characteristic lustre. A photon of visible light has 
much more energy, E = hf, than a radio-wave photon, so many more photons are 
involved when a given amount of energy is transferred with radio waves than with 
light. In general, the more photons that are involved in a process, the less a 
quantum description is required. For example, quantum effects are not important 
for designing radio antennas, but quantum effects are important in the interactions 
of gamma rays. The emission by a nucleus of a single gamma-ray photon, and its 
interaction with matter, are entirely quantum phenomena, the gamma-ray photon 
having a frequency billions of times higher than an AM radio-wave photon. 


Summary of Chapter | 


It is important to realize that quantum mechanics, rather than classical mechanics, 
is the fundamental theory. In principle quantum mechanics itself can give an 
account of where and when classical concepts are valid. Marconi did not need 
quantum mechanics, and quantum mechanics itself tells us why. But the devices 
within a modern radio depend on quantum mechanics in an essential way, and the 
reason for that is understood too. 


At the heart of the strange new quantum world are wave functions and probability 
amplitudes that, for alternative (but unrecorded) routes, must be added before 
taking the square of the modulus. From these, probabilities can be calculated; we 
simply must face the fact that Nature is not deterministic. In Chapter 2 we begin 
the story of how wave functions and probability amplitudes are calculated for a 
wide range of cases. 


If you find what you have just seen profoundly strange, even disturbing, then that 
is an entirely appropriate response! I end with the words of one of the principal 
creators of quantum mechanics, Werner Heisenberg: 


‘I remember discussions with Bohr which went on through many hours 
till very late at night and ended almost in despair; and when at the end of the 
discussion I went alone for a walk in the neighbouring park I repeated to 
myself again and again the question: Can nature possibly be as absurd as it 
seemed to us in these atomic experiments?’ 

Physics and Philosophy, 1958 


Summary of Chapter | 


Section 1 Microscopic systems on the scale of molecules, atoms, nuclei and 
subnuclear particles have discrete energy levels. Transitions between these energy 
levels involving the emission or absorption of photons are a prime source of 
information concerning the structure of matter. Systems such as ionized atoms 
have a continuum of energy levels above the highest discrete level of the bound 
system. 


Section 2 The world is not deterministic; observations involving photons, 
electrons, etc. are probabilistic. Radioactive decay provides an example: identical 
nuclei of a specific kind have a well-defined probability of decaying, but the 
instant of decay of an individual nucleus is unpredictable. A practical application 
of quantum indeterminism is the quantum random number generator based on 
the random appearance of photons on either side of a half-silvered mirror. A 
characteristic feature of the quantum world, which reappears in every chapter in 
different forms, is this: quantum physics predicts a fixed overall pattern of some 
kind, but individual events occur randomly within that pattern. 


Section 3 Light exhibits “wave—particle duality’ i.e. it exhibits both wave-like 
and particle-like behaviour. Photons are not simply packets of electromagnetic 
radiation, but have momentum and energy, and are detected in localized regions. 


Section 4 Electrons and indeed all material particles also exhibit wave—particle 
duality, displaying either wave or particle aspects depending on the type of 
experiment to which they are subjected. The de Broglie wavelength is related to 
momentum by A = h/p. The propagation of a particle as a wave is described by a 
complex-valued wave function which obeys an equation called Schrédinger’s 
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equation. In one dimension, the wave function is usually represented by U(z, t). 
Its significance is understood as follows: the probability of detecting a particle at 
time t in a small interval z, centred on position x, is |W (a, t)|? da. 


Section 5 To allow for the interference of two terms in a wave function, it is 
essential to add the two terms before taking the square of the modulus. More 
generally, squaring the modulus of a probability amplitude gives a probability. 
The interference rule states that when a given process can happen in two or more 
alternative (but unrecorded) ways, the probability amplitudes for the different 
ways must be added before taking the square of the modulus. 


Section 6 Quantum physics has fundamentally changed our understanding of 
the world. Do not be surprised if quantum mechanics seems shocking; its creators 
found it so! 


Achievements from Chapter | 


After studying this chapter, you should be able to: 


1.1 Explain the meanings of the newly defined (emboldened) terms and 
symbols, and use them appropriately. 


1.2 Give a general account of the way quantum physics has changed our 
understanding of the world. 


1.3 Write an account of, and answer questions concerning, the existence and 
significance of discrete energy levels in atoms and other microscopic 
systems. 


1.4 Explain what is meant by quantum indeterminism, with reference to real 
examples. 


1.5 Explain what is meant by wave—particle duality, drawing on experiments 
involving photons and material particles such as electrons. 


1.6 Explain how quantum physics accounts for wave—particle duality, using 
wave functions and probability amplitudes. 


1.7 State and use the de Broglie relationship for free particles. 
1.8 State and use Born’s rule for the wave function. 
1.9 State and use the interference rule. 


1.10 Explain when it is appropriate to add wave functions or probability 
amplitudes before squaring the modulus, and when it is not appropriate. 


1.11 Explain why the concept of a particle trajectory cannot be taken for granted 
in quantum mechanics. 


After studying Chapter 8.1 you should also be able to: 


1.12 Manipulate complex numbers, recognizing the significance of modulus and 
phase and appreciating their relationship to the real and imaginary parts. 
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Chapter 2 Schrodinger’s equation 
and wave functions 


Introduction 


The story is told that the physicist Peter Debye attended a talk given by de Broglie 
concerning his electron waves. Debye asked de Broglie what equation his waves 
obeyed, without receiving a satisfactory response. In the audience was Erwin 
Schrédinger ... and the rest is history! In the first half of 1926, Schrédinger 
(Figure 2.1) presented his famous wave equation which provides the basis for the 
wave-mechanical approach to explaining and predicting quantum phenomena. 


a 


Figure 2.1 Erwin 


Let us try to understand the problem Schrédinger was trying to solve. In Chapter 1 
we saw that the propagation of an electron through a slit and on to a detecting 
screen is described by a time-dependent wave function, V(r, t). The wave Schrödinger (1887-1961), who 
function contains information about where the electron is likely to be found. At : 

f as : . . was awarded the Nobel prize for 
time t, the probability of finding the electron in any small region centred on a hysics in 1933 
position r is proportional to the value of |W (r, t)|?. are f 


As time passes, the wave function changes, and the region where the electron is 
likely to be found also changes — in ordinary speech, we say that the electron 
moves. In order to understand this motion, we must know how the wave function 
varies in time at each point in space. Such a problem is like many others in 
physics. It is like predicting how water waves, sound waves or light waves 
propagate through space. In each case, there is a particular partial differential 
equation that links a rate of change in time to a rate of change in space — a wave 
equation. We need the wave equation that governs the dynamics of the quantum 
wave function: this equation is called Schrédinger’s equation, and it is the main 
subject of this chapter. 


A second major problem must also be tackled. You saw in Chapter 1 that systems 
have definite energy levels, and that transitions between these energy levels are 
accompanied by the emission of photons with characteristic frequencies. The 
resulting spectra can be very intricate and provide ‘fingerprints’ of nuclei, atoms 
and molecules. Of course, we would like to know where these energy levels come 
from. 


Rather surprisingly, the second problem is closely related to the first. This is 
because Schrédinger’s equation has a special set of solutions describing what 
are called stationary states. Each stationary state has a definite energy, and 
corresponds to an energy level of the system. It turns out that stationary-state 
wave functions have a very simple time-dependence; they describe standing waves 
that oscillate without going anywhere, rather like the vibrations of a plucked 
guitar string. Allowing for this fact, Schrédinger’s equation leads to a simpler 
condition, called the time-independent Schrodinger equation, which determines 
the spatial variation of the standing wave. In practice, the energy levels of a 
quantum system are determined by solving the time-independent Schrödinger 
equation, and you will see many examples of this throughout the course. One 
can imagine Schrédinger’s thoughts when he solved the time-independent 
Schrödinger equation for a system of one proton and one electron and found the 
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energy levels of the hydrogen atom. Although Bohr had calculated these with a 
semi-quantum model, everyone knew that Bohr’s model was far from the whole 
truth — it could not even be generalized to helium, the next simplest atom. Within 
months of Schrédinger’s success, Heisenberg had used the time-independent 
Schrödinger equation to explain the helium spectrum for the first time, and the 
assumptions behind Schrddinger’s ideas were vindicated. 


This chapter introduces Schrédinger’s equation and the associated 
time-independent Schrödinger equation. We will not derive Schrédinger’s 
equation here, but we will show that this equation is plausible, with its roots in 
Newtonian mechanics. Ultimately, Schrédinger’s equation is justified by the fact 
that it predicts the result of experiments. 


Section 2.1 begins by considering free particles, which are described by de 
Broglie waves. It is possible to guess the form of Schrédinger’s equation in this 
case, but it is not obvious how to generalize the result to include the effects of 
forces. In Section 2.2 we introduce a number of mathematical concepts, including 
that of a linear operator, which will help us to make this transition. In Section 2.3, 
we set out the three steps that lead to Schrédinger’s equation for any system that is 
understood in classical terms. Section 2.4 then describes some general properties 
of Schrédinger’s equation and its solutions. Finally, Section 2.5 introduces the 
time-independent Schrödinger equation which determines the energy levels of a 
quantum system. 


The Mathematical toolkit reviews differential equations (in Section 8.2) and 
partial differential equations (in Section 8.3). You may find it helpful to treat 
this material as an essential part of your study of this chapter. 


2.1 Schrodinger’s equation for free particles 


We start with the simplest case: a free particle, (that is, a particle free from any 
forces), moving in one dimension, in the positive x-direction. In Chapter 1 you 
saw that such a particle is described by the de Broglie wave function 


Wap(z,t) = Ae), (2.1) 
where A is a complex constant. You also saw that this wave function corresponds 
to a free particle with a momentum of magnitude 

h 
Sik 2.2 

Pp X 7 ( ) 
and an energy 

E = hf =hw. (2.3) 


Although Yap gives a complete description of a de Broglie wave, it is helpful to 
take a step backwards, and ask what partial differential equation this wave 
function satisfies. This will give us a valuable clue for writing down Schrédinger’s 
equation in more complicated situations. 


We shall simply write down a suitable partial differential equation and check that 
Wap in Equation 2.1 satisfies it. The appropriate equation is 
OV ap o hi? O° Vag 


i Ot 2m Ox? 


(2.4) 


2.1 Schrédinger’s equation for free particles 


It is not difficult to see that Yag satisfies this equation provided that a certain 
relationship between w and k is satisfied. The following worked example gives 
the details. 


Worked Example 2.1 Essential skill 
Verify that the de Broglie wave function Yap(x, t) = Aei(**—“) satisfies Verifying that a given function 
Equation 2.4, provided that w and k obey the condition hw = (hk)? /2m. satisfies a given partial 


differential equation 
Solution 
To answer a question like this, we simply insert the given function into both 
sides of the partial differential equation. Substituting Equation 2.1 into the 
left-hand side of Equation 2.4 gives 


OU p dersi) 
ih aL = We (Ae ) 


=i (—iw Acie) = hw Vap(z,t). 


Similarly, substituting it into the right-hand side gives 


na A’ Yag a jie ioe A) 
2m Ox 2m Ox? 
he i me 
eS C = ( ) Yag(z, t). 

2m 2m 
Equation 2.4 is satisfied if these two expressions are equal for all x and t. 
This is true if 

(hk)? 


MO ; (2.5) 
2m 


which is the condition specified in the question. 


The relationship that w and k must obey, Equation 2.5, is highly significant. 
Using the de Broglie relationship p = Ak, and the Newtonian expression for the 
magnitude of momentum, p = mv, the right-hand side of Equation 2.5 becomes 

hk)? 2 mv)? 

a T = ~ os gts ae 
which is the kinetic energy of the particle. Since we are dealing with a de Broglie 
wave, describing a free particle that experiences no forces, the potential energy 
may be taken to be zero. It follows that (hk)? /2m can be interpreted as the energy 
of the particle, Æ, and we know from Equation 2.3 that this is equal to hw. 
Consequently, Equation 2.5 is automatically satisfied, and we see that gp is a 
solution of Equation 2.4. 


Let’s take stock. We are looking for a partial differential equation that governs the 
dynamics of the wave function. We have seen that Equation 2.4 accomplishes this 
for Vag. To avoid clutter, we now write this equation without the subscript: 

Ow h? o? y 


ih AE = om Iz? (for a free particle). (2.7) 


In fact, this is Schrédinger’s equation for a free particle with zero potential energy. 
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The special case a = 1 gives the 
identity operator, written I. 
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It is worth noting two points: 


1. Equation 2.7 contains the first partial derivative OW /Ot on the left-hand side. 
This has an important consequence: if we know the wave function 
everywhere at time t, we can evaluate the right-hand side of Equation 2.7, 
and hence find OW /0t, the rate of change of the wave function at time t. 
This allows us to find W at a slightly later time t + ôt and, continuing the 
process, to find Y at any later time. In other words, future values of the wave 
function are fully determined by its values now, and by the form of 
Schrédinger’s equation. 


2. Equation 2.7 involves the imaginary number i = \/—1. It may seem unusual 
for a fundamental equation in physics to involve complex numbers in this 
way. In electromagnetism for example, complex numbers appear for 
convenience, but in quantum physics they are essential. Indeed, you have 
already seen that the wave function is a complex quantity. 


So far, so good, but not all particles are free. In realistic situations, such as an 
electron and a proton in a hydrogen atom, the particles are subject to forces. The 
main task in the first half of this chapter is to find an equation, more general than 
Equation 2.7, that governs the time-dependence of the wave function for a particle 
that is not free from forces. 


2.2 Mathematical preliminaries 


The task of generalizing Equation 2.7 is not trivial and involves some bold 

leaps of faith. This section will prepare the way by introducing three crucial 
mathematical concepts: operators, linear operators and eigenvalue equations. For 
the moment, these will be presented as pure mathematics, but you will soon see 
how these concepts are woven deeply into the language of wave mechanics. 


2.2.1 Operators 


We first introduce the notion of an operator acting on a function. For present 
purposes, we can say that 


An operator is a mathematical entity that transforms one function into 
another function. 


An example that will occur many times in this course is the differential operator 
d/dz that transforms the function f(x) into its derivative d f /dx. For example, 
5x is transformed into the constant function 5, and 32° + sin (42) is transformed 
into 9x? + 4 cos (4x). When dealing with functions of more than one variable, we 
shall also encounter operators like 0/0x and 0? /Ox? which take the first and 
second partial derivatives (with respect to x) of any function that they act on. 


Two other operators are very straightforward. One is simply multiplying a 
function by a constant: if œ is a number, then multiplication by a can be 
thought of as an operator. Applied to a function f(x), it gives a new function 
g(x) = af (x) which is just a times f(a). The other simple operator is 


2.2 Mathematical preliminaries 


multiplication by a function G(x). Applying this to a function f(x), produces a 
new function g(x) = G(x) f(x). This might seem trivial, but we shall often meet 
such operators. 


It is sometimes helpful to represent operators by symbols. Just as we can talk 
about a function f(x), so we can talk about an operator O. It is then useful to have 
a notation that indicates that O is an operator. The convention used throughout 


quantum mechanics is to place a hat above the symbol for the operator. So, The hat is more properly called 
instead of O, we write O. You are expected to do this in your written work. In a caret, but physicists generally 
print, operators are also set in roman (i.e. erect, rather than sloping) type. This is a pronounce the symbol Oas 
detail you may notice, but should not try to reproduce in handwriting. O-hat. 


Notation for operators 


When an operator is denoted by a symbol, the symbol must always wear a 
hat. We write: O, H, p,,, etc. 


Worked Example 2.2 Essential skill 
Consider the operators 1, Ôx and Os: Acting with operators on 
a d q2 functions 
OE e Ona 
1 dr’ 2 dx a 3 dr? a 


and the function f(x) = 4z?. Find the new functions obtained by operating 
on f(x) with each of these operators. 


Solution 
Applying each of the operators in turn: 


G1 f(0) = < (da?) = 80, 
Oof(x j= (35+ 32?) ( (4a”) = 24a + 122%, 
O3f(x =- (i +5) ( c) = = 2027. 


Exercise 2.1 The operators Oi Os and Os, are defined by: 


x a A ð ~ o? 
O= =; Of=32-+327; O3=—54+5. 
1 Ox’ 2 Oe EE 5 Ox? i 
Find the new functions obtained by acting with each of these operators on 
(a) g(x, t) = 3x?t?; (b) h(x, t) = asin(kz — wt). E 


2.2.2 Linear operators 


Nearly all the operators you will meet in the context of quantum mechanics have 
the special property of being linear. 
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Complex, of course, includes 
real. 


You may also have met 
eigenvalue equations in a 
different form: Av = Av, 
where A is a square matrix and 
v is a vector, i.e. a column 
matrix. In this case, is said to 
be an eigenvalue of the matrix 
A, and v is an eigenvector. 
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Linear operators 


An operator O is said to be linear if it satisfies the equation 


O(afi + Bf2) = (Ô f) + B(O fe) (2.8) 


for all functions fı and fə and all complex constants a and (3. 


A good example is provided by the differential operator d/dz. We can 
demonstrate that this is a linear operator by using the rules of differentiation since 


gp CAO + BARC) = FCA) + ge (Plata) = a GE + Ge 


for all functions fı (x) and f(x) and all complex constants a and 8. 


However, not all operators are linear. For example, the operator defined by 
Of (x) = ef 

is not linear because the left-hand side of Equation 2.8 gives 
O(af(z) + Gg(x)) = eet (2) +89) — ofl) x fale) 

while the right-hand side gives an entirely different result: 


o(Of(2)) + B(Og(2)) = ae? + Be. 


Exercise 2.2 Which of the following operators is linear? 

(a) Q, defined by Qf (zx) = log f(z). 

(b) R, defined by Rf(z) = d? f /dz?. 

(c) S, defined by Sf (x) = [f(2)]?. E 


2.2.3 Eigenvalue equations 


The next mathematical concept we need is that of an eigenvalue equation. 
Suppose that we let an operator A act ona function f(a). This will produce a new 
function, g(x) = Af (a), which in general is not closely related to the original 
function, f(x). However it may be possible to find a particular function, fi (x) 
with the property that 


Afi(x) = Ai fila), 


where A, is a constant. The effect of the operator Aon fi (2) is then especially 
simple; it just returns the original function, multiplied by a constant. In this case, 
the function f(x) is said to be an eigenfunction of the operator A, and the 
constant À; is said to be the eigenvalue corresponding to the eigenfunction fi (x). 


Generally, an operator may have several different eigenfunctions, each of which 
has a corresponding eigenvalue. The following box summarizes the situation. 


2.3 The path to Schrédinger’s equation 


Eigenvalue equations, eigenfunctions and eigenvalues 


Given an operator A, the eigenvalue equation for that operator is 


A f(e) =AfF(2), (2.9) 
where A is a complex constant. 


The eigenvalue equation is generally satisfied by a particular set of functions 
filc), fo(x),..., and a corresponding set of constants À1, A2, .... These are 
the eigenfunctions and the corresponding eigenvalues of the operator A. 


Eigenvalue equations occur throughout physics and applied mathematics. For 
example, the modes of vibration of strings and membranes are determined by 
eigenvalue equations. Usually, neither the eigenfunctions nor the eigenvalues 

are known at the outset, and the mathematical challenge is to find them. In 

some contexts, the search for eigenfunctions is limited to a particular class of 
physically-sensible functions, for example functions that are continuous, or that 
do not diverge at infinity; when restrictions like this are applied, they will be made 


explicit. 
Worked Example 2.3 Essential skill 
Confirm that sin(qx) is an eigenfunction of d?/d? and determine its Checking whether a given 
eigenvalue. Is sin(qz) an eigenfunction of d/dx? function is an eigenfunction 
Solution 


We let the operators act on the given function, and see what emerges. 
2 d 
qa ( sin(ax)) = qz (aeos(ax)) = —4° sin(qz), 


so sin(qz) is an eigenfunction of d? /dx? with eigenvalue —q?. However, 
4 (sin(q2)) = qoos(az), 
daz 


so sin(qx) is not an eigenfunction of d/dz. 


Exercise 2.3 Show that e'°” is an eigenfunction of d/dz and d?/dx? and 
determine the corresponding eigenvalues. i 


2.3 The path to Schrodinger’s equation 
With the mathematical preliminaries out of the way, we now return to the task of 


writing down Schrédinger’s equation for a particle that is subject to external 
forces. 
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2.3.1 Observables and linear operators 


Although we are aiming to generalize de Broglie waves, it is helpful to consider 
them once more, using the mathematical language of eigenfunctions. Omitting the 
dB subscript for simplicity, the de Broglie wave function is V(x, t) = Ag hee) 
It is easy to see that this is an eigenfunction of both 0/Ox and 0? /Ox?. Partially 
differentiating once, and then again, gives results similar to those you found in 
Exercise 2.3: 


o : 

gte = ikl (tt), (2.10) 
o? -12 2 
gatet = (ik) (x,t) = —k*U(z,t). (2.11) 


The eigenvalues, ik and —k?, do not seem to have much physical significance, but 
we can scale both sides of the above equations by constants to obtain something 
more striking. Multiplying Equation 2.10 by —ih and Equation 2.11 by —h?/2m, 
we obtain 


„0 
-ihg te = hkW (x,t) (2.12) 
h2 0? (hk)? 


The eigenvalues hk and (Ak)? /2m are now highly significant. Using the de 
Broglie relation p = hk, and the fact that E = p?/2m, we see that they are the 
momentum and kinetic energy associated with the de Broglie wave. Put another 
way, the de Broglie wave function Y (x,t) = Aei(ke—t) is an eigenfunction of 
both —ih0/Ox and —(h? /2m)0? /Ox, and the corresponding eigenvalues are 
the momentum and kinetic energy associated with this wave. This could be a 
curiosity, but it turns out to be far more than that. 


One new term is needed before we state a fundamental quantum-mechanical 
principle. Quantum mechanics is unusual in dealing with concepts like the wave 
function that cannot be measured directly. It is therefore worth having a term that 
describes all the other types of quantity — position, momentum, energy, and so on 
— that can be measured. Such measurable quantities are called observables in 
quantum mechanics. We can now state a principle that pervades the whole of 
quantum mechanics: 


Operators, eigenvalues, observables and measurement outcomes 


Each observable O is associated with a linear operator Ô. 
As a general rule, the eigenvalues of the operator O are the only possible 
outcomes of a measurement of the observable O. 


Applying this principle to Equations 2.12 and 2.13, we see that the 
quantum-mechanical operators corresponding to momentum and kinetic energy 
are —ihO/Ox and —(h?/2m)0?/0x?. Using the convention of placing hats on 
operators, the transition from the momentum pz to the momentum operator P, 
and from the kinetic energy Ekin to the kinetic energy operator Ba can be 
expressed as follows: 


2.3 The path to Schrédinger’s equation 


ue PEO. 
D = DS o (2.14) The symbol => is used to 
— indicate the transition from a 
A nO) classical variable to a 
as ie Dim Glee A quantum-mechanical operator. 


A subscript x has been placed on the momentum observable py and its associated 
operator P, to remind us that the momentum we are talking about is that in the 
direction of increasing x. The inclusion of a subscript will help avoid problems 
later, when we deal with particles moving in the negative x-direction or with 
particles moving in three dimensions. 


Two other important observables are represented by very simple operators. The 
position coordinate x is associated with a position operator X, which simply tells 
us to multiply by the variable x, so 


RKU (a, 1) = zY (2; t). (2.16) 


Also, any function f(x) of the position coordinate is an observable, and the 
corresponding operator simply tells us to multiply by that function of x. In 
particular, we will be concerned with the potential energy function, V (x). The 
corresponding potential energy operator tells us to multiply by the potential 
energy function. Thus, 


V(x) W(x, t) = V(x) W(z,t). (2.17) 


Note that neither Equation 2.16 nor Equation 2.17 is an eigenvalue equation since 
x and V(x) are both functions of x and not constants. 


Exercise 2.4 A certain free particle is described by the wave function 

T(x, t) = Aei(-kt—“t) This describes a plane wave propagating along the 
negative x-axis. What is the result of operating with p,, on this wave function? 
How do you interpret your answer? a 


2.3.2 Guessing the form of Schrodinger’s equation 


The idea that observables are represented by operators provides the key to 

extending Schrédinger’s equation beyond the uneventful world of free particles. 

Using the kinetic energy operator Exin = (—h2/2m)0?/Oz introduced in the last 

section, Schrddinger’s equation for a free particle moving in one dimension takes 

the form 

OW (a, t) 
ot 


How do you think this equation might generalize to a particle that is not free? 


ih = Buin (z, t). (2.18) 


A particle that is not free has both kinetic and potential energy. It is therefore 


reasonable to suppose that the appropriate generalization is 
. OW (x,t ~ p 
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Using the explicit form of the kinetic energy operator, and remembering that the 
potential energy operator simply tells us to multiply by the potential energy 
function, we obtain 

OW (x, t) h? 0°V (2, t) 


ih E +V(x)¥ (x,t). (2.19) 


This is Schrödinger’s equation for a particle of mass m moving in one 
dimension, with potential energy function V(x). We have used a fair amount of 
guesswork and intuition to motivate it, but revolutionary science always involves a 
leap of the imagination. Here, we have taken the Schrodinger equation for a free 
particle and included the effects of interactions by adding the potential energy 
function in the simplest possible way. The left-hand side of the equation has been 
left unchanged, and still involves OW /0t, the first-order partial derivative of Y 
with respect to time. 


2.3.3 A systematic recipe for Schrodinger’s equation 


We have guessed the form of Schrédinger’s equation for a single particle moving 
in one dimension. However, a more general procedure is needed. We now present 
a three-step recipe that is guaranteed to produce Schrédinger’s equation for any 
system with a well-defined classical description. The recipe applies to systems in 
three dimensions and to systems that contain many particles but, to begin with, we 
will continue to discuss a single particle in one dimension. The recipe is based on 
the ideas that led to Equation 2.19. The first step is to give a suitable classical 
description of the system. 


Step l: Specify the Hamiltonian function 


In classical physics, the motion of a single particle in one dimension (along the 
x-axis) is governed by Newton’s second law: 


Maz = Fy, (2.20) 


which relates the acceleration a, of a particle of mass m to the force Fy acting on 
it. In quantum mechanics however, the key concepts are not acceleration and 
force, but momentum and potential energy. It is possible to rewrite Newton’s 
second law in terms of these concepts. 


Firstly, we note that the force is related to the gradient of the potential energy 
function, V(x). For example, a particle moving along the z-axis with a potential 
energy function V(x) experiences the force Fp = —OV/0x. The significance of 
this relationship between force and potential energy is illustrated in Figure 2.2. 
Secondly, we can write ma, = d(mv,)/dt = dp,/dt. So the x-component of 
Newton’s second law becomes 


eae (2.21) 


2.3 The path to Schrédinger’s equation 


AV 
Figure 2.2 


(a) An example of a potential energy function 
V(x) in one dimension. This potential energy function has a 
minimum at x = 0 and tends to zero far from the origin. A 
classical particle with total energy Æ < 0 would be trapped by 
this potential energy well, and oscillate to and fro about the 
origin. (b) The corresponding force, Fy. The magnitude of the 


(a) force is greatest where the potential energy function has the 
steepest slope, and is equal to zero where the potential energy 


RY 


(b) 


The Irish mathematician William Hamilton (Figure 2.3) took these ideas one step 
further. He introduced a quantity now called the Hamiltonian function, H. This 
is the total energy of the system, but with the kinetic energy expressed in terms of 
the momenta of the particles. For a particle of mass m moving along the x-axis, 
subject to a potential energy V(x), the Hamiltonian function is 


(2.22) 


Because the kinetic energy term p2/2m does not depend explicitly on z, we have 
OH /Ox = OV/Ozx, so Equation 2.21 can be written as 

dp, OH 

dt — Ox’ 
which is one of Hamilton’s equations of classical mechanics. Each classical 
system is characterized by its own Hamiltonian function, and it is the form of this 
function (rather than the value) that really matters, as this is what determines how 
the system develops in time. It is the Hamiltonian function that provides the 
bridge to quantum mechanics. Writing down the Hamiltonian function for the 
system of interest is the first step in our recipe for constructing Schrödinger’s 
equation. 


(2.23) 


Exercise 2.5 Write down the Hamiltonian function for a free particle of mass 
m, confined to the x-axis. 


Exercise 2.6 A particle of mass m, confined to the x-axis, is subject to 
Hooke’s law, i.e. it experiences a force that is directed towards the origin and is 
proportional to the particle’s displacement from the origin, so Fy = —C'x, where 
C is called the force constant. Verify that this force is consistent with the potential 
energy function V(x) = 5C a”, and write down the Hamiltonian function for the 


system. Oo 


function has zero slope (i.e. at x = 0). The force Fy is 
negative for x > 0 and positive for x < 0, and so tends to pull 
the particle back towards the origin. 


Figure 2.3 Sir William 
Rowan Hamilton (1805-1865), 
who reformulated Newtonian 
mechanics to simplify the 
calculation of planetary 

orbits. Hamilton’s form of 
mechanics was later found to be 
highly suitable for making 

the transition to quantum 
mechanics. 


The force Fy = —Cx 

is exactly what is needed 

for simple harmonic motion 
in one dimension. Hence 
V(x) = C2? is the potential 
energy function for a 
one-dimensional harmonic 


oscillator. 
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Use the answer to Exercise 2.6. 
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Step 2: From function to operator 


The second step in our recipe is to replace the Hamiltonian function by the 
corresponding Hamiltonian operator. Incidentally, since the Hamiltonian 
function describes an observable quantity — the total energy of the classical 
system — the Hamiltonian operator will be the energy operator for the system. 
Some rules for converting observables into operators were given in Section 2.3.1. 
For example, we made the correspondence 


ðr’ 
from which it follows that the kinetic energy operator is 
AD 2 2 2 
Pi: 1 fo i oO 
= h = 2.24 
2m 2m ( i =) 2m ðr?’ es) 


in agreement with Equation 2.15. Applying these rules, the Hamiltonian operator 
takes the form 
p? pE h2 62 


H= + V(x) = -— = + V(x). (2.25) 


Exercise 2.7 Write down the Hamiltonian operator for a particle of mass m, 
confined to the x-axis, and subject to Hooke’s law with force Fẹ} = — Cz. a 


Step 3: Assembling Schrodinger’s equation 


The final step is to assemble Schrédinger’s equation, which always take the same 
general form 
— > -fhet 
at (2,0), 
where H is the Hamiltonian operator. Of course, this operator depends on the 
system under study. For a particle of mass m moving along the x-axis, H is given 
by Equation 2.25 and Schrédinger’s equation becomes 
OW (x, t) R? 0°W(z,t) 

ih A = V(x) (x,t). 

OE 2n ðr? FV EE) 
which agrees with Equation 2.19. 


Exercise 2.8 Write down Schrédinger’s equation for a particle of mass m 
confined to the x-axis for the following cases: 


(a) a free particle, 


(b) a particle subject to Hooke’s law with force constant C. E 


Our recipe for constructing Schrödinger’s equation is very general. So far, 

we have applied it to a single particle in one dimension, but it also works for 
many-particle systems in three dimensions, including nuclei, atoms, molecules 
and metals. As the system becomes more complicated, so does its Hamiltonian 
function, and hence its Schrödinger equation, but the same principles apply. 


To illustrate the point, consider a system of two particles of masses mı and me, 
interacting with one another and moving in three dimensions. The electron and 


2.4 Wave functions and their interpretation 


proton in a hydrogen atom would be a good example. The Hamiltonian function 
of such a system takes the form 


H 


2 2 2 2 2 2 
z Pig t Piy + Piz 4 Pie t Pay + Paz 


2.2 
2M1 2Moə t Vir) ( 6) 


where pj, is the x-component of the momentum of particle 1, and so on. The 
potential energy function V(r) expresses the interaction between the particles and 
is assumed to depend only on the distance r between them. Now, we can make the 
transition to quantum mechanics by replacing each momentum component by the 
appropriate linear operator. For example, we replace pi, by Piz = —ihO/021, 
where xı is the x-coordinate of particle 1. This leads to the Hamiltonian operator 
h2 82 82 82 h2 82 82 82 
mal a z] E © ge ge u 
The Schrödinger equation for the two-particle system is obtained by substituting 
this operator into 
OV 
Ot 
In this case, the wave function is a function of all the coordinates of both particles, 
and also of time: Y = Ọ (z1, Y1, 21, £2, Y2, Z2, t). This book will avoid such 
complications, but you will see how they are dealt with later in the course, when 
we discuss hydrogen, helium and more complicated atoms. 


fi = 


2m 


ih Hw. 


2.4 Wave functions and their interpretation 


Now that we have a recipe for writing down Schrédinger’s equation, you might 
suppose that the next step would be to solve this equation in simple cases. 
However, the solution of Schrédinger’s equation is a wave function, Y (x, t), 
which has a much less obvious physical significance than water waves or sound 
waves, for example. We therefore begin by discussing the interpretation of the 
wave function. 


The first point to make is that the wave function is complex. Given the factor of 
i = y—1 that explicitly appears in Schrédinger’s equation, and the fact that 
potential energies, masses and so on, are real, a purely real function Y, with 
Im(W) = 0, cannot satisfy Schrédinger’s equation. 


The complex nature of the wave function is a nuisance when it comes to drawing 
graphs. To give a complete picture of the wave function, we need to show both the 
real and imaginary parts. However, for some purposes, this would be too much 
information. As we noted in Chapter 1, the wave function itself cannot be 
measured. It is the square of the modulus of the wave function that tells us where 
a particle is likely to be found. This is the content of Born’s rule, proposed in 
1926 by Max Born (Figure 2.4). Figure 2.4 Max Born 
(1882-1970), was the first to see 
A a that the wave function must 
2.4.1 Born’s rule and normalization be interpreted in terms of 
probability. He was awarded the 
1954 Nobel prize for physics. 


You met Born’s rule in Chapter 1, so we give only a brief reminder here. For our 
purposes, it is sufficient to consider the version that applies to a single particle in 
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one dimension. In this case, Born’s rule states that the probability of finding the 
particle within a small interval 6x centred on the point x is 


probability = | U(x, t)|? dx. (2.27) 


The quantity |W (x, t)|? is called the probability density; in one dimension this 
represents the probability per unit length of finding the particle around a given 
point. In general, there will be a region where |Y (x, t)|? is appreciably non-zero, 
and this is the region where the particle is likely to be found. However, the spatial 
extent of this region should not be confused with the size of the particle whose 
behaviour the wave function describes. For example, the wave function describing 
an electron in a hydrogen atom is appreciable over the volume occupied by the 
atom, whereas the electron itself has no discernible size. 


Equation 2.27 presumes that the interval dx is small enough for (x,t) to be 
effectively constant over that interval. If this is not the case, the probability of 
finding the particle between xz = a and x = bis given by the integral: 
=b 
probability = l |W (a, t)|? dz. 
@L=a 
Of course, we know that, if the position of the particle is measured, it must be 
found somewhere. Since certainty is characterized by a probability of 1, we 
require that 


OO 
/ |W (ax, t)\? da = 1. (2.28) 
—oo 

This is called the normalization condition, and wave functions that satisfy it are 
said to be normalized. 


If we have a technique for solving Schrddinger’s equation, we have no reason to 
expect that the solution that emerges will be normalized. Unless we are very 
lucky, it will not be normalized. Normalization is something that is imposed after 
solving Schrddinger’s equation in order to ensure that the wave function is 
consistent with Born’s rule. 


Suppose that we are given a function ®(z, t) that satisfies Schrédinger’s equation, 
but is not normalized. How should we proceed? The first thing to note is that if 
(x,t) is a solution of Schrédinger’s equation then so is A®(x, t), where A is any 
complex constant. To obtain a normalized wave function (x,t), we write 
(x,t) = A(x, t) and then impose normalization by requiring that 


l= f |W (a, t)|? dx = f |A ®(x,t)|? dz = a f |®(a, t)|? dz. 


The wave function U(x, t) will then be normalized provided that we choose 


o0 —1/2 
ai= (f elet) de) . (2.29) 


— 00 
Any complex number A that satisfies this condition will achieve normalization 


but, for simplicity, we generally choose A to be real and positive. The constant A 
is called the normalization constant. 


2.4 Wave functions and their interpretation 


Although Schrédinger’s equation does not guarantee that wave functions are 
normalized, it does have an important property that underpins Born’s rule. If a 
wave function Ẹ (x, to) is normalized at a particular instant to, and we use 
Schrédinger’s equation to find Y (x, t) at any other time t, then it can be shown 
that the wave function Ẹ (x,t) remains normalized. This will be proved much 
later in the Course. 


2.4.2 Wave functions and states 


The wave function can be combined with Born’s rule to find out where a particle 
is most likely to be found. But there is much more to physics than this, and the 
wave function has a much wider role to play. That wider role concerns the state of 
a system. 


In classical physics the state of a system can be completely specified by giving the 
values of a set of measurable quantities. For example, in the case of a free particle 
of mass m, it would be sufficient to specify the position and velocity of the 
particle at some particular time. Thanks to Newton’s laws, this would determine 
the position and velocity of the particle at any other time. 


In quantum physics the situation is very different. The idea that a system may 
exist in a variety of states is still of crucial importance, but the state of a system 
cannot generally be specified by listing the values of measurable quantities, such 
as position and velocity. Due to quantum indeterminism, even a free particle does 
not possess precise values of position and velocity at any particular time. Jn wave 
mechanics, the state of a quantum system at a given time is specified by a wave 
function at that time. In fact, according to wave mechanics: 


The wave function describing a particular state of a system provides the 
most complete description that it is possible to have of that state. 


We emphasize that the wave function contains all the information we can possibly 
have about the state of a quantum system. For example, in the case of a free 
particle, it contains all the information we can have about its energy and 
momentum. (Later chapters will explain how this information is extracted in more 
complicated cases.) However, that information is generally probabilistic; it 
provides only the possible outcomes of measurements and the probability of each 
of those outcomes. This is a deep fact about the way the world works, rather than 
a failure of the wave function to specify the state of the system in a more precise 
way: there is an intrinsic and unavoidable indeterminism in the quantum world. 


As emphasized in Chapter 1, the indeterminism of quantum physics limits the 
scope of predictions about the future, but it does not prevent all predictions. Given 
the wave function that describes the state of a quantum system at some particular 
time, Schrddinger’s equation determines how that wave function changes with 
time and hence allows us to obtain probabilistic information about the system at 
later times. This will always be the case as long as the system remains 
undisturbed. However, when a measurement is made, and out of all possible 
outcomes one alone becomes actual, the state suddenly changes and the wave 
function undergoes a correspondingly abrupt change, which is called a ‘collapse’. 
This collapse of the wave function is not described by Schrédinger’s equation. 
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Finally, it is important to note that the wave function that describes a state is not 
unique. If a wave function is multiplied by a complex number of the form e'®, 
where a is real, the state described by this modified wave function is unchanged, 
and all the predictions about the possible outcomes of measurements and their 
probabilities are unchanged. For example, since |e®| = 1, it is clear that 


le" W(x, t)? = Je" PT (x,t)? = |W (ax, t)’, 


so Born’s rule and the normalization of the wave function are unaffected. 
Complex constants of the form el”, where a is real, are called phase factors. 
When we describe the state of a system by a wave function, the choice of the 
phase factor multiplying the whole wave function is arbitrary. However, it is 
important not to misunderstand this point: if a wave function is the sum of two 
parts, the relative phase of the parts does matter, since it determines whether the 
parts will interfere constructively or destructively. You saw this effect in the 
two-slit interference of electrons described in Chapter 1. 


2.4.3 The superposition principle 


Wave functions have another important property: they obey the following 
principle. 


The superposition principle 


If Yı and Wy are solutions of Schrédinger’s equation, then so is the linear 
combination aW, + bY2, where a and b are arbitrary complex numbers. 


Recalling the definition of a linear operator (Equation 2.8), and writing 
Schrédinger’s equation in the compact form 


„0 < 
ina Y = HY, 


we see that the superposition principle is valid provided that ih0/Ot and Hare 
both linear operators. It should come as no surprise that they are, since partial 
differentiation is certainly linear and all observables in quantum mechanics 
(including the Hamiltonian) are represented by linear operators. You can give a 
more explicit proof in the following exercise. 


Exercise 2.9 Verify that aW (x, t) + bW2(z, t) is a solution of Schrédinger’s 
equation (Equation 2.19), provided that Yj (x,t) and Y2(x, t) are both solutions 
of Schrédinger’s equation. 


Exercise 2.10 Is it true to say that any linear combination of normalized wave 
functions represents a state of a system? | 


The superposition principle is important for many reasons. First, it underlies 
interference effects, like the two-slit interference of electrons described in 
Chapter 1. You may recall that this was explained by representing the wave 
function as the sum of two parts, one associated with passage through slit 1, and 
the other associated with passage through slit 2. The justification for such a wave 
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function is provided by the superposition principle: each part of the wave function 
obeys Schrédinger’s equation, and so does their sum. 


Using the superposition principle, we can combine wave functions in countless 
ways. For example, we can form linear combinations of different de Broglie 
waves. The de Broglie waves themselves extend over the whole of space and are 
therefore idealizations. But some linear combinations of de Broglie waves only 
extend over a finite region, and provide much more realistic descriptions of real 
particles. 


The superposition principle also leads to some of the deepest puzzles of quantum 
physics. If a state described by the wave function Yı has some property (say, 
energy E1) and a state described by the wave function Yə has another property 
(say, energy 2), what can be said about the state described by Yı + Wo, suitably 
normalized? The remarkable answer is that, in some sense, this state has both 
energy E; and energy E»; its energy is uncertain. Schrödinger never felt 
comfortable with this idea, and dramatized the situation by dreaming up wave 
functions in which a cat is both dead and alive. As you can imagine, the course 
will have more to say about this later. You will see that the superposition principle 
underpins quantum teleportation and quantum computers. 


2.5 The time-independent Schrodinger equation 


You saw in Chapter | that some quantum systems have discrete energy levels, and 
that transitions between those levels produce characteristic patterns of spectral 
lines. We now turn to the second major topic of this chapter, which is to explain in 
general terms where these energy levels come from, and to show how their values 
can be calculated. 


In wave mechanics, the state of a system is described by a wave function, and 
that wave function must satisfy Schrédinger’s equation for the system. In this 
section, for the sake of simplicity, we shall mainly restrict our attention to a 
one-dimensional system consisting of a particle of mass m subject to a potential 
energy function V(x). Schrédinger’s equation then takes the form of a partial 
differential equation in two variables, x and t: 
2 92 

none L d ae D evtt t: (2.30) 
We now focus on solutions that describe states of definite energy. It turns out that 
these solutions can be found by a standard approach to tackling partial differential 
equations — the method of separation of variables. This method is described in 
the next subsection. 


2.5.1 The separation of variables 


The key step in the method of separation of variables is to assume that the wave 
function U(x, t) can be written as a product of a function of x and a function of t: The symbol ~ on the right is the 


U(x, t) = Y£) T(t). (2.31) lower-case Greek letter psi. l 
Make sure you can distinguish it 
There is a mathematical motive behind this assumption: it leads to greatly from the upper case psi (VY) on 


simplified equations. The simplification is achieved at the expense of only finding the left. 
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some of the solutions to Schrddinger’s equation, but this does not matter for our 
purposes since all the energy levels can be accounted for by product wave 
functions of the form given in Equation 2.31. 


When we insert the product wave function into Schrédinger’s equation we get: 
o 
ha vat) = | oa + v(a)| w(a)T(t). (2.32) 


Since 0/0t does not affect y(x) and 8? /ðx? does not affect T(t), this may be 
rewritten as 


2 92 £ 
Y(z) in) = T(t) -Z tal am veulo) : 


The functions T(t) and y(x) both depend on a single variable, so we can replace 
the partial derivatives by ordinary derivatives. Doing this, and dividing both sides 
by w(x)T(t) we obtain, 
1 .dT(t) 1 R? dyz) 

To de va | ma væv) 
Now, the left-hand side of this equation is independent of x and the right-hand 
side is independent of t. How can two sides of an equation be guaranteed to be 
equal when each depends on a different independent variable? The answer is 
simple: each side of the equation must be equal to the same constant. We shall 
denote this constant by Æ, which is not an arbitrary choice of symbol, as you will 
soon see. Thus, 


1 „aT n 1 R? d?w(z) 
TŒ dt p(z) | 2m da? Pe) 
This gives us two ordinary differential equations 
yoo) = ET), (2.33) 
dt 
h? d(x) 
- OE + V(a)(@) = E(x). (2.34) 


The first equation involves only the variable t and the second involves only the 
variable x, so the two variables have been separated from one another. The 
constant E that appears in both equations is called a separation constant. 


The method of separation of variables is useful because ordinary differential 
equations are usually much easier to solve than partial differential equations. 
Equation 2.33 has a particularly simple solution: 


T(t) = et Pt/h, (2.35) 


Exercise 2.11 Verify that T(t) = e~'”*/" is a solution of Equation 2.33. E 


The solution for T(t) is universal in the sense that it applies to all systems, 
irrespective of the potential energy function V (a). However, the second 
differential equation 


Ve ee) 
2m dr? 


+ V(ax)(x) = Ey(zx) (Eqn 2.34) 


is characteristic of the system under study. This equation will be of major 
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importance in this course. It is called the time-independent Schrödinger 
equation for the given system. 


If we know the potential energy function V(x), we can look for a function y(x) 
that satisfies the time-independent Schrödinger equation for a given value of E. If 
we can find such a function, we can put everything back together and write down 
the product wave function 


U(x, t) = W(x)T(t) = plaje E, (2.36) 


By virtue of its construction, starting from Equation 2.32, this wave function 
satisfies Schrédinger’s equation. The wave function involves the separation 
constant Æ, and we now turn to consider the significance of this quantity. 


2.5.2 An eigenvalue equation for energy 


In order to interpret the separation constant, Æ, we write the time-independent 
Schrédinger equation in the form 


h o? 


2m Ox? 


+V(x)| y(x) = Ey (2) (2.37) 


where we have reverted to using partial derivatives. This makes no difference 
because the function w(x) only depends on a single variable. However, it helps us 
to recognize the term in square brackets as the Hamiltonian operator for the 
system. It follows that the time-independent Schrödinger equation can be written 
in the compact form 


y(x) = Ev(z). (2.38) 


This is an eigenvalue equation of the kind introduced in Section 2.2.3. In this 
case, the operator concerned is the Hamiltonian operator, which is the operator 
that corresponds to the total energy of the system, so: 


The time-independent Schrédinger equation of a system is the energy 
eigenvalue equation of that system. 


In general, the eigenvalues of a quantum-mechanical operator O are the possible 
measured values of the corresponding observable, O. Hence, the separation 
constant, E can be interpreted as the energy of the system. The wave function in 
Equation 2.36 is associated with a particular value of F, so it is natural to assume 
that: 


The product wave function U(x, t) = o(x)e”"/", where y(x) is an energy 
eigenfunction with eigenvalue F, describes a state of definite energy E. If 
the energy of the system is measured in this state, the value F will certainly 
be obtained. 


For reasons that will be explained later, the states of definite energy that are 
described by product wave functions are called stationary states. 
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2.5.3 Energy eigenvalues and eigenfunctions 


The time-independent Schrédinger equation is the eigenvalue equation for 
energy. When eigenvalue equations arise in specific physical contexts, it is 
sometimes agreed to restrict the search for eigenfunctions to a certain class 
of ‘physically-sensible function’. This is the case with the time-independent 
Schrödinger equation. 


We insist that the energy eigenfunctions (x) should not diverge as 
x — +00. 


It is easy to see that eigenfunctions that diverge as x — +00 are never going to 
be of any physical interest. If we were to try to use such eigenfunctions in 
product wave functions (stationary states) we would find that these stationary 
states, and their linear combinations, would also diverge at infinity. Born’s rule 
(Equation 2.27) forbids such solutions since it requires that the wave function be 
normalized (Equation 2.28). 


In the next chapter, you will see how the time-independent Schrödinger equation 
is solved in specific cases. However, it is important to have an overview of the 
type of result that emerges. A typical potential energy function is shown in 
Figure 2.5. 


AV (x) 


Figure 2.5 A finite well. 
The potential energy function, 
V(x), has a local minimum, 

—Vp is finite for all values of x, 
and remains finite as x 
approaches +00. 


This potential energy function has a minimum and then levels off, approaching 
flatness at infinity. It could describe the potential energy that arises when one 
particle is attracted to another fixed particle. Potential energy functions of this 
type are called finite wells. Finite wells may be drawn in many ways. In our 
example we have chosen to identify the top of the well with zero energy and the 
bottom with energy — Vo, which you can see is negative. 


We shall concentrate, to begin with, on energies below the top of the well (that is, 
energies that are negative in Figure 2.5). If we try to solve the time-independent 
Schrödinger equation in this case, bearing in mind that the solutions must not 
diverge at infinity, something quite remarkable happens. Physically acceptable 
solutions are only found for certain specific values of Æ. These are the energy 
eigenvalues below the top of the well, and they form a discrete set — that is to say, 
they can be labelled F1, E2, E3, .... Corresponding to each energy eigenvalue, 
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there is an energy eigenfunction, so the eigenfunctions also form a discrete set, 
labelled y (x), Wo(x), W3(x), .... Equation 2.37 for the eigenfunctions becomes 


— = +V (x)| Un(x) = Entn(2), (2.39) 


where the label n (which is called a quantum number) is useful in picking out a 
particular eigenvalue and eigenfunction. 


Now, the energy eigenvalues give us the possible energies of the system. So the 
discrete set of energy eigenvalues implies a discrete set of energy levels, two of 
which are shown in Figure 2.6. In other words, quantization of energy arises 
directly from the time-independent Schrödinger equation and the conditions that 
identify its physically acceptable solutions. Transitions between the energy levels 
then account for the characteristic pattern of spectral lines observed for the 
system. An example is shown in Figure 2.7. 


AV (x) AE 


Figure 2.6 For a given finite 

well, the energy eigenvalues 

between the top and bottom of 

= - =——— fi» the well are restricted to discrete 

E values, the energy levels of the 
system, two of which are shown. 
(Energy levels for real nuclei, 
atoms and molecules were given 
in Figure 1.3.) 


Figure 2.7 The optical spectrum of molecular nitrogen 
(N2). Each line is due to photons having an energy that is the 
difference between two energy eigenvalues of the Hamiltonian 
operator for Nə. 


2.5.4 Stationary states and wave packets 


Corresponding to the energy eigenvalue E,, and the energy eigenfunction, Yn (£), 
we have the stationary-state wave function 


Un(z,t) = Un(x)e Eth, (2.40) 


If the system is in this state, and the energy is measured, the value Æp is certain to 
be obtained. This is a state of definite energy. We can now explain why these 
particular solutions, W,,(x,¢), of Schrédinger’s equation are said to describe 
stationary states. 


Remember Born’s rule, which tells us that the probability of finding the particle in 
a small interval of length dx, centred on x, is for any Y, 


probability = |Yn (x, t)|? 62 = U* (x, t)V,,(a, t) dz. (2.41) 
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Let us see what this implies for the stationary-state wave function of 
Equation 2.40. Substituting this wave function into Born’s rule gives 


|Un(a, t)|? 6x = [on (ae Ent] “ab, (ae EtA Si: 


Now, we know that En is an energy, so it follows that it is a real quantity, equal to 
its own complex conjugate (E = En). Using this fact, we see that 


[Un(a t)|? dx = W(x) up (xe Ert Pe iEnt/h Sg, 


Noting that the product of the two exponentials is unity, we conclude that 
[Yn (x,t)? dar = [yn (x)|? ôx. (2.42) 


It follows that for the state described by Y, (x, t) the probability of finding the 
particle in any small interval of length 6x is independent of time. Although we 
shall not prove it yet, anything that we can measure is also independent of time in 
this state: that is why it is described as being stationary. 


It is not quite true to think of the state as frozen; there is a time-dependent phase 
factor, e!#n*/ so the real and imaginary parts of the wave function change with 
time, even though the modulus of the wave function does not. All parts of the 
wave oscillate in phase with one another, rather like a plucked guitar string, but 
with the added complication that the wave function is complex. 


Stationary states correspond to nothing that is familiar in the world of 
human-sized objects. Before the introduction of wave mechanics it might have 
been thought that the lines seen in atomic spectra had something to do with 
electrons moving in closed orbits around atomic nuclei and occasionally jumping 
from one orbit to another. Such a picture is inconsistent with wave mechanics. In 
the stationary states that wave mechanics uses to explain the energy levels of 
atoms, nothing can really be said to ‘move’ at all. The probability of detecting the 
electron in any given region never changes. 


Of course, there are objects that do move in nature and quantum physics must 
have a way of describing them. It is clear that stationary states cannot do this. 
However, it must be remembered that stationary states are a special type of 
solution to Schrédinger’s equation. There are other types of solution, not 
described by product wave functions, that can describe motion. You will see in 
Chapter 6 how these non-stationary states are constructed. The key idea will turn 
out to be the superposition principle. By adding together different stationary-state 
wave functions in an appropriate way, we can produce a normalized wave function 
which is not itself a stationary state, and can describe motion. Such linear 
combinations of stationary states are called wave packets. 


The concept of a wave packet will also help us to get around another difficulty. So 
far, we have said nothing about the energies above the top of the finite well in 
Figure 2.5. This was quite deliberate because the situation here is more subtle. 
For any value of E above the top of the well, it is possible to find an energy 
eigenfunction (2) that satisfies the time-independent Schrödinger equation, and 
does not diverge as x — too. Thus, in addition to the discrete energy levels 
below the top of the well there is also a continuum of levels above the top of the 
well (see Figure 2.8). However, it turns out that eigenfunctions above the top of 
the well do not approach zero as x — +oo, so neither they, nor the corresponding 
stationary-state wave functions U(z,t), can be normalized. Born’s rule tells us 
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that these wave functions are physically unacceptable and cannot really describe 
states. But we know that there are states that correspond to the continuum above 
the top of the well — atoms after all can be ionized into states of positive energy. 
How are these states to be described? It turns out that certain linear combinations 
of positive-energy stationary-state wave functions can be normalized, and it is 
these wave packets that are used to describe systems in the continuum. We cannot 
do full justice to this point here, but more extensive discussions will be given in 


Chapter 6. 
AV (x) AE 
_.. |. Figure 2.8 Fora given 
finite well, there are energy 
el eigenvalues for all values of 
2 P e E: energy above the top of the 
well. These are the continuum 
energy levels of the system, 
corresponding to ionized states 
of atoms. 
2.6 Schrödinger’s equation, an overview 
Schrödinger’s equation, 
OW x 
ih— = HY 2.43 
1 Ot , ( ) 


is the equation that governs the time-development of the wave function W that 
describes the state of a system for which H is the Hamiltonian operator. If, at a 
particular time to, we are given the wave function Ẹ (x, to), then we can calculate 
W(x, t) for all later times, t > to. To this extent the world is deterministic: 

the way the world is now determines the way the world will be in the future. 
However, the world is also indeterministic; the wave function provides the most 
complete description we can have of the state of a system, yet it is limited to 
providing probabilistic information about that state. It may predict the outcome of 
a particular measurement with probability 1, i.e. with certainty, as in the case of 
the energy corresponding to a stationary state, but there will inevitably be other 
aspects of that state that are indeterminate. Schrédinger’s equation allows us to 
predict with great accuracy the probability of an electron appearing in a specified 
region of a screen or the probability of a nucleus decaying in the next second, but 
it will not allow us to predict exactly where a particular electron will be seen or 
exactly when a particular nucleus will decay. 


This element of uncertainty in no way detracts from the great power of 
Schrédinger’s equation. It is the means to understand much (in principle, just 
about all) about atoms, molecules, nuclei, solids, liquid helium, white dwarf stars, 
neutron stars, the production of the elements.... The problem is how to solve 
Schrédinger’s equation for complex systems. In the following chapters we shall 
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present solutions for some simple systems in order to illustrate general principles 
that also apply in more complex cases. 


As you work through these chapters you will be introduced to all of the basic 
principles of wave mechanics. However, as you start that journey you may find it 
useful to keep in mind the following preliminary version of these principles: 


Preliminary principles of wave mechanics 


1. The state of a system at time t is represented by a wave function (x,t). 


2. An observable, such as energy or momentum, is represented by a linear 
operator, such as —ihO/Ox for the momentum component py. 


3. As a general rule, the only possible outcomes of a measurement of 
an observable are the eigenvalues of the associated operator. 


4. The time-evolution of a system in a given state is governed by Schrédinger’s 
equation. 


5. A measurement will cause the collapse of the wave function — a sudden 
and abrupt change that is not described by Schrédinger’s equation. 


Summary of Chapter 2 


Section 1 A key task is to establish the equation that governs the 
time-development of the wave function. This is Schrédinger’s equation. It is 
possible to guess the form of Schrédinger’s equation for a de Broglie wave 
describing a free particle of energy Aw and momentum ñk. 


Section 2 To extend Schrédinger’s equation to particles subject to forces, 
several mathematical concepts are needed, including operators (specifically linear 
operators) and eigenvalue equations, which take the form A f(x) = Af (x). 


Section 3 In wave mechanics each observable is associated with a linear 
operator, and the only possible outcomes of a measurement of an observable are 
the eigenvalues of the corresponding operator. For example, the momentum 
component py is represented by the linear operator P, = —ihO/Ox. A de Broglie 
wave is an eigenfunction of both the kinetic energy and the momentum operators. 


Schrédinger’s equation for a system can be written down using a three-step recipe. 
Step 1: write down the Hamiltonian function of the system. This is the energy of 
the system, including kinetic and potential energies, but with the kinetic energy 
written in terms of momentum. Step 2: replace all classical observables by the 
corresponding operators to obtain the Hamiltonian operator. Step 3: write down 
Schrédinger’s equation in the compact form: 


OV 
a 
and expand the right-hand side by applying the appropriate Hamiltonian operator 

to a wave function that depends on the coordinates of all the particles, and on time. 


ih Hy, 


Section 4 The wave function (x, t), a solution of Schrédinger’s equation, is 
complex and cannot be measured. It provides the most complete specification 


Achievements from Chapter 2 


possible of the state of a system. The probability of finding the particle in that 
state in a small interval dx, centred on 2, is |W (x, t)|? 62 (Born’s rule), provided 
that the wave function has been normalized. Wave functions which differ by an 
overall multiplicative phase factor describe the same state. Since Schrédinger’s 
equation involves only linear operators, a linear combination of two solutions is 
also a solution. This is the superposition principle. 


Section 5 Schrédinger’s equation for a particle moving in one-dimension 
with potential energy V(x) is satisfied by product wave functions of the form 
WU (x,t) = y(x)T(t). The method of separation of variables shows that T(t) has 
the universal form ei“*/", while w(x) satisfies the time-independent Schrödinger 
equation 
R o? 

- 2m Ox? 
If the energy of the system is measured in a state described by the product wave 
function y(x) ei#t/N the value E will be observed with certainty. 


+V(2)| p(x) = Ey(2). 


The time-independent Schrödinger equation is an eigenvalue equation for energy: 


Hy (x) = Ey(2). 


Insisting that the energy eigenfunctions (x) should not diverge as x approaches 
too ensures that, if V(x) is a finite well, the energy eigenvalues between the 
bottom and the top of the well are discrete while those above the top of the well 
form a continuum. The product wave functions are usually called stationary-state 
wave functions because they lead to probability distributions that are independent 
of time. Those corresponding to the continuum are not physically acceptable since 
they cannot be normalized. However, some linear combinations of stationary-state 
wave functions, known as wave packets, can be normalized and can be used to 
describe the non-stationary states associated with the continuum. 


Achievements from Chapter 2 


After studying this chapter, you should be able to: 


2.1 Explain the meanings of the newly defined (emboldened) terms and 
symbols, and use them appropriately. 


2.2 Recognize the action of simple operators; identify those that are linear; 
determine whether given functions are eigenfunctions and if so determine 
the corresponding eigenvalues. 


2.3 Write an expression for the operator representing momentum, Py. 


2.4 Give an account of the relationship between operators, their eigenvalues and 
the possible outcomes of measurements. 


2.5 Write down the Hamiltonian function, the corresponding Hamiltonian 
operator and Schrédinger’s equation for simple systems of particles 
interacting through potential energy functions. 

2.6 Write a brief account of the significance of the wave function U(x, t) 
including its interpretation in terms of position measurements and 
probabilities; explain the requirement for Y (x, t) to be normalized. 
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2.7 Explain why the linearity of H and the superposition principle as applied to 
Schrédinger’s equation are relevant to understanding two-slit interference. 


2.8 Use the method of separation of variables to show that stationary-state wave 
functions have the universal form Y (x,t) = w(x)e~#*/", where y(x) 
satisfies the time-independent Schrödinger equation. Describe the general 
properties of stationary states. 


2.9 For a one-dimensional system consisting of a particle in a finite well, 
describe the way in which the time-independent Schrédinger equation 
accounts for the discrete and continuum energy levels. 


2.10 State a preliminary version of the principles of wave mechanics. 


2.11 Recognize that multiplying Y (x,t) by a phase factor e'“, where a is real, 
makes no difference to any probabilities extracted from Ẹ (x, t). 


Chapter 3 Particles in boxes 


Introduction 


In Chapter 2, we introduced Schrédinger’s equation and stressed its importance in 
wave mechanics, but did not solve it for any specific system. This chapter will set 
up and solve Schrédinger’s equation for a number of situations involving a 
particle in a box. By this, we mean that the particle is more or less confined to a 
limited region of space but that, within this limited region, the particle feels no 
forces. The region of space may be in one, two or three dimensions. 


Technological devices that exploit the quantum behaviour of particles in boxes are 
becoming increasingly important. Such devices are generally constructed from 
semiconducting materials, such as silicon and gallium, that have electrical 
conductivities intermediate between those of good conductors (such as copper), 
and insulators (such as glass). At the heart of many of these devices is a tiny 
structure called a quantum dot that consists of a speck of one semiconductor 
embedded in a larger sample of another semiconductor (Figure 3.1). The enclosed 
speck may be as small as one nanometre across, and contain only a few hundred 
atoms. Some of the electrons in the speck can become detached from their parent 
atoms, but are inhibited from entering the surrounding semiconductor; such 
electrons behave very much like particles in a tiny three-dimensional box. 


Figure 3.1 Three quantum 
dots of various shapes. Each 
consists of a small ‘box’ of 
one semiconductor entirely 
surrounded by another 
semiconductor. 


The trapped electrons have wave-like properties and occupy energy levels, just 
as the electrons in an atom occupy energy levels. Indeed, quantum dots are 
sometimes referred to as artificial atoms. In contrast to real atoms, however, 
quantum dots can be tailored to produce a range of properties that atoms and 
molecules fail to provide. The light-emitting properties of quantum dots are 
particularly important; quantum dots are used in solid-state lasers (for CD and 
DVD players), lighting systems, solar cells, and even as fluorescent markers for a 
range of biomedical applications (Figure 3.2). 


Figure 3.2 Quantum dots emitting light 
after exposure to ultraviolet radiation. The 
colour of the light depends on the size of the 
quantum dot, which increases from left to 
right. 


Quantum dots, in which electrons are trapped in a tiny three-dimensional box, 
are just one example of a type of structure that uses microscopic arrangements 
of semiconductors to confine electrons. Some other examples are shown in 
Figure 3.3. In the case of a quantum wire, where a thin thread of one 
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(a) quantum dot 


(b) quantum wire 
semiconductor 1 


\ 
fc) quantum wafer semiconductor 2 


Figure 3.3 Microscopic structures formed by 
embedding one semiconductor inside another: 
(a) a quantum dot, (b) a quantum wire and (c) a 
quantum wafer. 


semiconductor is embedded in another, the electrons are free to 
move along the thread, while being confined in the other two 
dimensions. These electrons can be thought of as being 
trapped in a two-dimensional box while being unconfined in 
the third dimension. In a quantum wafer, where a very thin 
layer of one semiconductor is sandwiched between broader 
layers of another, the electrons are free to move in the plane of 
the wafer while being confined in the dimension perpendicular 
to the plane of the wafer. These electrons can be thought of as 
being trapped in a one-dimensional box while forming a sort of 
freely-moving two-dimensional gas in the plane of the wafer. 


We shall not describe the detailed physics of semiconductor 
devices here, but will concentrate on the underlying quantum 
behaviour of particles in boxes. The focus will be on finding 
the energy levels and stationary-state wave functions in 

these model systems, and on interpreting the results. However, 
you should bear in mind that our models are closely related 

to real systems of increasing technological importance. 


In the sections that follow we shall consider boxes in one, 
two and three dimensions, though we shall often refer to them 
as wells, since in each case our starting point will be a classical 
potential energy function describing the ‘well’ in which 

the particle is trapped. Section 3.1 discusses the idealized 
case of a particle in a one-dimensional infinitely-deep well. 
Section 3.2 extends the discussion to infinitely-deep wells 

in two and three dimensions. Finally, Section 3.3 considers 

a particle in a one-dimensional well of finite depth. You 

will see how sets of discrete energy levels emerge from 
Schrédinger’s equation (Figure 3.4) and how particles can 
sometimes be found in regions that would be forbidden 
according to classical physics (Figure 3.5). 


energy levels 
Q 
particle detected 
in classically 

forbidden region 

Figure 3.4 Schrödinger’s equation shows that Figure 3.5 In quantum 

a particle confined to an infinitely deep well has physics there is some chance of 

a set of discrete energy levels. a particle being found in regions 
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that are forbidden according to 
classical physics. 


3.1 The one-dimensional infinite square well 


3.1 The one-dimensional infinite square well 


The first ‘box’ we are going to investigate is called a one-dimensional infinite 
square well . This is the kind of box that confines the electrons of a quantum 
wafer in the dimension perpendicular to the plane of the wafer. We start by 
considering the classical behaviour of a particle trapped inside such a box and 
then move on to consider the behaviour of the analogous quantum system by 
setting up and solving the appropriate Schrödinger equation. 


3.1.1 The classical system 


The classical system that is our starting point consists of a particle of mass m, 
confined to a one-dimensional region of length L. The potential energy well 
responsible for trapping the particle has abrupt, infinitely-high walls, but between 
the walls, the particle feels no forces. Such a well is called a one-dimensional 
infinite square well. It can be represented by the potential energy function 


Vigz)=0 ford<2< Lb, (3.1) 
Vigi=o forx < 0andz> L, (3.2) 


which is illustrated in Figure 3.6. 


Figure 3.6 The potential energy function for a one-dimensional infinite square 
well. 


This specification of the potential energy function involves some arbitrary choices; 
we have chosen to locate the walls of the well at x = 0 and x = L, and have taken 
the potential energy function to be equal to zero inside the well. We might have 
chosen differently; by locating the walls of the well at x = —L/2 and z = +L/2 
for example, or by assigning a constant positive value Vo to the potential energy 
inside the well. Such choices would alter the potential energy function of the 
system, and would therefore affect the way we describe the particle’s behaviour, 
but they wouldn’t influence the behaviour itself. We shall return to this point later. 


According to classical physics, the total energy of a particle trapped in a 
one-dimensional infinite square well can have any finite positive value. Since the 
particle’s potential energy inside the well is zero, the total energy of the particle is 
equal to its kinetic energy. A non-zero kinetic energy allows the particle to bounce 
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back and forth between the impenetrable walls with constant speed. Increasing 
the energy would simply increase the speed. But no matter how great the energy, 
there is no possibility of the particle being found outside the well. The region 
outside the well is therefore said to be classically forbidden. 


Exercise 3.1 How would you modify Equations 3.1 and 3.2 to represent 
a one-dimensional infinite square well with walls at = —L/2 and 
x = +L/2? m 


3.1.2 Setting up Schrodinger’s equation 
The procedure for writing down Schrödinger’s equation was given in Chapter 2. It 
consists of three steps: 


1. Write down the Hamiltonian function H (py, 2), which is the classical 
energy of the system expressed in terms of momentum and position. 


2. Convert H (pz, x) into an operator, H, using the replacement rules 


a p h? 0? 
x= Pa => -ih— d — . 
i “i Pa Har E Om 2m Ox? 
3. Write down Schrédinger’s equation in the form 
a(r t) «a 
ih——— = HẸ(z,t). 
OE (2,4) 


For a system consisting of a particle of mass m in a one-dimensional potential 
energy well V(x), this prescription leads to a Schrödinger equation of the form 
OW (x,t) i? 6? U(z,t) 


ih aan aa o> +V(ax)W(a,t). (3.3) 


We are interested in the stationary states of this system — states of definite 
energy. These are special solutions of Schrédinger’s equation that are products of 
separate functions of x and t: 


V(x, t) = p(w)T(t). 


Following exactly the same steps as in Chapter 2, we replace V(r, t) in 
Equation 3.3 by the product w(x)7'(t) and then separate variables to obtain two 
ordinary differential equations 


. dT 
h? dy 


Equation 3.4 is satisfied by a function of the form 
nS R (3.6) 


while Equation 3.5 is the time-independent Schrodinger equation. This is the 
eigenvalue equation for energy; its solutions (a) are the energy eigenfunctions 
and the corresponding values of E are the energy eigenvalues. 


3.1 The one-dimensional infinite square well 


In the case of the one-dimensional infinite square well, V(x) = 0 inside the box, 
so Equation 3.5 becomes 


h? dy 
while outside the box, where V (x) is infinite, the only solution to Equation 3.5 is 
w(x) =0 for x <Oand a > L. (3.8) 


The task that now confronts us is that of solving Equation 3.7, subject to 
appropriate boundary conditions. As you will soon see, the energy eigenfunctions 
and eigenvalues that emerge form discrete sets, so we can talk about the 

nth eigenfunction, 7,(x), and the nth eigenvalue, Æn. The corresponding 
stationary-state wave function is 


V,,(2, t) = Un(x) ert!” (3.9) 


This describes a state of definite energy En: if you measure the energy of the 
system in this state, you will certainly get the value En. So, by solving 
Equation 3.7, and obtaining all the energy eigenvalues, we will find all the 
possible energy levels of the system, together with the stationary-state wave 
functions, which give a complete description of the state of the system in each of 
these energy levels. 


Exercise 3.2 Explain the distinction between an energy eigenfunction (2) 
and a wave function Y (x, t). a 


3.1.3 Solving the time-independent Schrodinger equation 


Solving the time-independent Schrödinger equation inside the infinite square well 
is not difficult. Equation 3.7 is a second-order differential equation, so its general 
solution contains two arbitrary constants. Moreover, the general solution w(x) 
must be such that, when differentiated twice, the result is y(x) multiplied by the 
constant (—2mE/h”). This immediately suggests that the solutions will be of the 
form 


U(x) = Asin(kx) + Bcos(kx) forO<a< L, (3.10) 


where A and B are arbitrary constants and k is a constant that depends on the 
properties of the system. 


Exercise 3.3 Show that Equation 3.10 is indeed a solution of Equation 3.7, 
provided that 


mE 
dias (3.11) 


Exercise 3.4 In deriving Equation 3.11, we took the positive square root of the 
equation k? = 2mE/h?. It would be equally valid to take the negative square 
root, to obtain k = —/2mE/h. Explain why this case can be ignored without 
any loss of generality. a 
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3.1.4 Boundary conditions for the eigenfunctions 


We have found the general solution of the time-independent Schrödinger equation 
inside the well, but this is not sufficient. The eigenfunctions we are looking for 
must also satisfy some boundary conditions relating to finiteness and continuity, 
as we shall now explain. 


First, we always require that energy eigenfunctions should remain finite as 
x — +00. This is not a problem in the present case because we already know that 
y(x) is equal to zero outside the well (Equation 3.8). 


a= 
ts 
RY 
2 
oak 
) 
RY 


So 
mA 
z 


dab, 
0 T > ale a 
0 i T 0 ; >. 
(b) 
v R wt 
0 a F 
d a 
0 L r 0 | > 
(e) 


Figure 3.7 A schematic illustration of the 
continuity boundary conditions: (a) v(x) and 
dw /dz are both continuous functions provided 
that V(x) remains finite; (b) this conclusion is 
unaffected by finite jumps in V (æ); (c) an 
infinite discontinuity in V (x) produces a finite 
discontinuity in dy/dx, which implies that y(x) 
has a sharp kink, but remains continuous. 
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More importantly, there are conditions that refer to the 
continuity of the eigenfunction and its derivative. Because 
these conditions are important throughout this chapter, we shall 
introduce them in a form that applies to all one-dimensional 
situations. Then, we will specialize to the case of present 
interest — a one-dimensional infinite square well. 


The main point is that the time-independent Schrödinger 
equation must be satisfied everywhere, without any exceptions 
at individual points. If V(x) is finite, Equation 3.5 implies 
that d?y/dx? must also be finite. It then follows that y(x) 
and dy/dx must both be continuous (since any jump in these 
functions would prevent d?7)/dx? from having a finite value). 


The potential energies of real systems are always finite, but this 
may not be true in some model systems. The one-dimensional 
infinite square well, for example, has a potential energy function 
that becomes infinite at each wall of the well. At any point 
where the potential energy function is infinite, Equation 3.5 
implies that: either y(x) is equal to zero, or d? /dz? is 
infinite (or both these things are true). Bearing in mind that 
infinities only appear in artificial models, it is sensible to 

take the function 7(x) to be as smooth as possible (subject 

to the above considerations). We therefore assume that any 
infinity in d?y/dz? is associated with a finite discontinuity 

in dy/dz, but that y(x) remains continuous everywhere. 

This leads to the following boundary conditions: 


Continuity boundary conditions 


The eigenfunction 7(x) is always continuous. The 

first derivative dy/dx is continuous in regions where 
the potential energy function is finite; it need not be 
continuous at points where the potential energy function 
becomes infinite. 


These conditions are illustrated schematically in Figure 3.7. 

It is worth noting that they depend on whether the potential 
energy function is finite, but not on whether it is continuous. 
The functions x(a) and dy/dx are continuous at points where 
the potential energy function has a finite jump; dy/dx can 
only be discontinuous at points where V(x) becomes infinite. 


3.1 The one-dimensional infinite square well 


The continuity boundary conditions apply at all points in space, but they become 
particularly useful at points where we need to match expressions for w(x) in two 
neighbouring regions. In the case of a one-dimensional infinite square well, this is 
necessary at the points x = 0 and x = L, which mark the walls of the well. Since 
the potential energy function goes to infinity at these points, we only insist that the 


eigenfunction 7(x) remains continuous; there is no restriction on dy/dx. Since Technical note: continuity 
w(x) is equal to zero outside the well, the continuity boundary conditions give boundary conditions apply to 
w(0)=0 and w(L)=0 (3.12) energy eigenfunctions and so to 


stationary-state wave functions; 
In the next section we shall impose these conditions on the general solutions we they do not necessarily apply to 


found earlier, and examine their consequences. all wave functions. 


3.1.5 Applying the boundary conditions: energy quantization 


Returning to the general solution y(x) = Asin(kx) + B cos(kx) of the 
time-independent Schrédinger equation inside the well, and applying the 
boundary condition ~(0) = 0, it is immediately apparent (since sin 0 = 0 and 
cos 0 = 1) that 


B=0. 
Then, applying the other boundary condition, (L) = 0, it follows that 
Asin(kL) =0. (3.13) 


We are not interested in cases where A = 0, since requiring that A = 0 and 

B = 0 would leave us with no solution at all. So what Equation 3.13 really tells 
us is that sin(kL) = 0. The only solutions to this equation are kL = nr, where n 
is an integer (i.e. a whole number). We can rule out the possibility n = 0, since 
this would give y(x) = 0 everywhere. We can also restrict attention to positive n, 
since we have taken k to be positive (see Exercise 3.4). It therefore follows that 
kL= nn forn = 1,2,3,.... 


Of course, this is actually a condition on the acceptable values of k, since L is a 
fixed parameter of the system, describing the width of the well. Thus, even though 
the time-independent Schrödinger equation has solutions corresponding to all 
positive values of k, the only ones consistent with the continuity boundary 
conditions correspond to certain discrete values of k that we can denote 

k1, ko,k3,... etc. where 


kn = = forn = 1,2,3,... ete. (3.14) 


This is very significant, since it leads to restrictions on the energy eigenfunctions 
and energy eigenvalues. In particular, it follows from Equation 3.10 that the 
allowed energy eigenfunctions inside the well are of the form 


Yn(£) = Ansin (=) forn = 1,2,3,...ete., (3.15) 


where Aj, A2, A3,... etc. are constants. Similarly, Equation 3.11 implies that the 
corresponding energy eigenvalues F1, E2, F3,... etc. are given by 


nr? h2 n2 h2 


2o o F Sine for n = 1, 2,3,... ete. (8.16) Recall that i = h/2r. 
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This is a momentous result. Because the quantum number n is restricted to 
integer values, it shows that the acceptable energy eigenvalues are discrete — 

i.e. separate and distinct — in this case taking the values F} = 17h? /2mL?, 

E> = 4n7h? /2mL?, Ez = 9n7h? /2mL?, and so on. These are the possible 
energy levels of the system. The lowest energy level is F4. According to quantum 
mechanics, no particle in a one-dimensional infinite square well can have less than 
this amount of energy. In particular, there is no state of zero energy. The fact that 
the energy levels are discrete is an example of energy quantization. 


Quantization does not come from the time-independent Schrödinger equation 
itself, but rather from the imposed boundary conditions and the fact that we are 
dealing with a confined system of limited size, L. If we allowed the box to get 
larger and larger by increasing L, the difference in energy between any two 
neighbouring levels would approach zero as L approached infinity. Under these 
circumstances our particle in a box would behave increasingly like a free particle, 
which can have any positive energy. Energy quantization is an immensely 
important feature of confined quantum systems. Even so, you should not make the 
mistake of thinking that it is somehow the heart of quantum physics. The example 
of the free particle makes it clear that there are quantum systems that do not have 
discrete energy levels. So, despite its importance, the essence of quantum physics 
is not to be found in the discreteness of energy. 


Exercise 3.5 Show that the difference between any two neighbouring energy 
levels in a one-dimensional infinite square well of width L does indeed approach 
zero as L approaches infinity. | 


3.1.6 The normalization condition 


Another condition must be imposed in order to obtain sensible wave functions. 
The physical requirement that the particle must be found somewhere, combined 
with Born’s rule for the probability distribution of position, leads to the 
normalization condition for the wave function U(x, t) that was discussed in 
Section 2.4.1. We require that 


[een Par = 1. 6.17 


=09 


We are interested in stationary-state wave functions of the form 
U,,(a,t) = vn(x)e#n4/", In this case, 


[Wn(w,t)? = lnla) = nla)? e = |b (x)]?, 


which leads to a normalization condition for the energy eigenfunctions Yp (x): 


Eigenfunction normalization condition 


| a (3.18) 


3.1 The one-dimensional infinite square well 


In the case of a one-dimensional infinite square well with walls at x = 0 and 
x = L, we know that the energy eigenfunction (a) must be zero outside the 
well, so the eigenfunction normalization condition of Equation 3.18 becomes 


L 
f lYn(£)? da = 1. (3.19) 
0 


Applying this condition to the eigenfunctions of Equation 3.15 we obtain 


f Pas f ansia 2%) Par =a, 


which may be rewritten as 
L 
AnP f sin? (=) isi (3.20) 
0 L 


Most of the integrals in this course can be evaluated by changing the variable of 
integration and using the list of standard integrals printed inside the back cover of 
the book. In the present case, we substitute y = nrz /L, from which it follows 
that x = (L/n7)y and so dx = (L/nz) dy. We must also transform the limits of 
integration, noting that x = 0 corresponds to y = 0 and that x = L corresponds to 
y = nr. Hence Equation 3.20 becomes 


L nT : 
Ane f sin? ydy = 1. 
0 


According to the table of integrals inside the back cover, the definite integral in 
this expression has the value n7/2, so we get 


L nt 
A, ?-——=1 
[An nt 2 
It follows that |A,,| = ,/2/L. Remembering that each An may be complex, this 


means that 


E a 
A. = L e, 


where 6 is a real constant. Phase factors, such as e®, which multiply entire 

eigenfunctions and wave functions, never have any physical consequences, and 

can be chosen arbitrarily (as discussed in Chapter 2). We shall make the simplest 

choice by setting 5 = 0 so that et? = 1. Hence we can say that for each value of n 
2 


An = 4/ =. 21 
T (3.21) 


Substituting this value into Equation 3.15, and remembering that the 
eigenfunctions vanish outside the well, we see that the normalized energy 
eigenfunction Yp (x) corresponding to the eigenvalue Æp is 


n(x) = 2sm (=) fr0<r< L (3.22) 


Un (2) —0 elsewhere. (3.23) 


Figure 3.8 shows the first four of these energy eigenfunctions y to Y4, together 
with the corresponding energy levels F; to E4. Note that, as expected, the allowed 
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energy eigenfunctions are continuous but sharply kinked at the walls of the well 
due to the infinite discontinuity in the potential energy function at these points. 
Also note that the separation between neighbouring energy levels increases with 
increasing n due to the factor of n? in the expression for En (Equation 3.16). 


i 
i 
i 


Figure 3.8 Plots of the normalized energy eigenfunctions y1 (x), W2(x), Y3 (x) 
and 74(a), together with the corresponding energy eigenvalues F,, Æ>, Æ3 and 
E4, which are the first four energy levels in a one-dimensional infinite square 
well. We are treating the eigenfunctions as real functions, though any of them 
could be multiplied by a complex phase factor of the form e? that would not 
affect the calculation of any measurable quantity. 


3.1.7 The stationary-state wave functions 


Having found the energy eigenvalues En, and energy eigenfunctions ~,,(x), we 
can now write down the stationary-state wave functions. For any positive integer, 


3.1 The one-dimensional infinite square well 


n, the stationary-state wave function is 


2 £ 
Valz, t) = \/> sin (=) e`Ent/h forQ<a<L (3.24) 


Wam 0 elsewhere. (3.25) 


The constant \/2/L ensures that the wave function is normalized. This wave 
function provides a complete description of the state in which the particle has a 
definite energy En = n?1°h?/2mL?. Due to the nature of 7,(x) (see Figure 3.8), 
Y, (x,t) is always equal to zero at the ends of the box. This wave function is also 
equal to zero at (n — 1) points inside the box. These (n — 1) points are called the 
nodes of the wave function (the zeroes at the ends of the box are not counted as 
nodes). Thus, the ground state wave function, with n = 1, has no nodes. 


Each stationary-state wave function, of the kind specified by Equation 3.24, 
describes a complex standing wave. A standing wave is one that oscillates 
without propagating through space. All points in the disturbance that constitutes a 
standing wave oscillate in phase with one another, with the same frequency but 
different amplitudes. The nodes of a standing wave remain fixed. 


To understand this in more detail, it is helpful to split the wave 
function into its real and imaginary parts. This is done by noting that 
eiBnt/h — cos(Ent/h) — isin(E,t/h), giving 


2 Ent 
Re(W;,(a,t)) = [Zin (=) cos (=) forO<a2<L (3.26) 


2 Ent 
Im(W,(2,2)) = [2 sin (=) sin (2) irae en (3.27) 


In each case the right-hand side represents a sinusoidal function of position, 
sin(naax/L), scaled at each point by a time-dependent factor. The time-dependent 
factor is itself sinusoidal and oscillates with a period T,, = 27h/E,,. Various 
‘snapshots’ of the real part of the wave function Y; (x, t), taken at different stages 
during its period T}, are shown in Figure 3.9. If you imagine these snapshots 
forming an animated sequence you will get a good idea of what a standing wave is 
like. Different stationary-state wave functions, with different values of n, have 
different numbers of nodes and different periods. As n increases, so does the 
energy of the state and the number of nodes, while the period of oscillation, Tn, 
decreases. 


Figure 3.9 emphasizes the time-dependence of the wave function. However, it is 
equally important to appreciate that Y: (x, t) describes a stationary state in which 
the probability density |W, (x, t)|? is independent of time. Remember, that 

the diagrams in Figure 3.9 only show the real part of the wave function. No 
matter how the real part of the wave function Y; (x,t) changes at any particular 
value of x there will always be compensating changes in the imaginary part to 
guarantee that |Y; (x, t)|? is independent of time, and the same is true for any 
stationary-state wave function, W,,(z, t). 


ARo(W) (x, t1)) 
0 i : 
ARo( P(x, t2)) 
Q 2 : 
ARo(W (2, t3)) 
a dk 
ARo(T (a, ta)) 
0 5 
ARo(Ti (z, ts)) 
eS ' 


Figure 3.9 Snapshots of the 
real part of the ground state 
wave function V(x, t) from 
iy =O to tg = 7)/2. This is a 
standing wave with no nodes. 
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AyD, |? Exercise 3.6 Verify that the quantity |Y, (zx, t)|? is independent of time by 
ws using its equivalence to |Re(W,,(z, t))| oo [Im(W,, (zx, t))] A m 
0 7 = Figure 3.10 gives plots of the probability density, |Y» (x, t)|?, for n = 1,2,3 
and 4. The probability of finding the particle in a small region of length dz, 
Aly)? centred on 2, is given by |W,,(z, t)|? ôx. This remains independent of time. 
Tawa It is worth giving special consideration to the points of zero probability density in 
q Figure 3.10. In classical physics, a particle with positive kinetic energy trapped in 
0 Li a one-dimensional infinite square well would bounce back and forth between the 
impenetrable walls. In quantum physics, it is entirely inappropriate to think of a 
Ata]? particle in a stationary state as bouncing back and forth at all. If you were to 
AN forget this fact, you might wonder how a particle can move from one side of the 
> well to the other when there is a point of zero probability density in the way. How 
ee can a particle that is on the left side of the well move to the right side of the well 
without travelling through the point where the wave function vanishes and the 
theal? corresponding probability density is zero? Even asking such a question reveals a 
VaAV anya failure to recognize just how radical quantum mechanics is. 
0 DE In quantum mechanics, a particle has no position until it is measured, so it makes 
no sense to speak of the particle as ‘moving’ from place to place. There is no 


problem associated with the particle having to ‘travel through’ the nodal point, 
since it does not have a velocity any more than it has a position. The particle’s 
position or velocity might be measured in an appropriate experiment, but any 
value obtained in this way only tells us about the properties of the particle 
immediately after the measurement is taken. Before the measurement, the particle 
is in a state described by a wave function Y, (x,t). This wave function gives the 
most complete description that it is possible to have of the state of the system, but 
it does not assign definite values to either the position or the velocity of the 
particle. 


Figure 3.10 Plots of the 
probability density |Y, (z, t)|? 
for n = 1,2,3 and 4. Note that 
in each case there are points at 
which the probability density is 
zero. The points at which 
|W,,(a, t)|? = 0 correspond to 
the ends of the box and the 
nodes of W,,(z, t). 


Exercise 3.7 As the above discussion indicates, a stationary-state wave 
function such as Y, (x,t) does not in any sense allow us, moment by moment, to 
‘track’ a moving particle. Consequently we cannot say that the coordinate x that 
appears in the argument of Y, (x,t) represents the position of the particle at 
time t. What, then, does x represent? E 


3.1.8 Alternative descriptions of the well 


Our description of a one-dimensional infinite square well of width L involved 
some special choices: 


1. We took the potential energy function to be zero inside the well. 
2. We placed the walls of the well at x = 0 and at x = L. 


Neither of these assumptions is essential. The potential energy inside the well 
could be given any constant value, since this would still correspond to zero 
force inside the well, and the walls of the well could be placed at x = xo and 

x = xo + L, where xo is any coordinate along the x-axis. In this section we shall 
look at the effects of changing our description of the well. This will change our 
description of the energy eigenvalues, energy eigenfunctions and stationary-state 
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wave functions, but it will not change the physical behaviour of the particle in the 
well, which cannot depend on our arbitrary choices. 


Changing the zero of potential energy 


In classical physics we have an arbitrary choice of where to place the zero of 
potential energy. Redefining the zero of potential energy has no influence on the 
behaviour of a system because only differences in potential energy are physically 
significant. The same is true in quantum mechanics. 


[o i a 
Figure 3.11 The potential energy function for a one-dimensional infinite 
square well when the potential energy associated with the bottom of the well has 


the value Vo. 


Suppose that we take the potential energy at the bottom of the well to have the 
constant value Vo > 0 (Figure 3.11). What difference does this make to our 
descriptions? In this case, the time-independent Schrödinger equation inside the 
well takes the form 


h? dy 
= + Vy = Ey(x) for0<a<L, 
which can be rearranged to give 
h2 dy 
oe a (E-Vo)v(x) forO<a<L. 


Comparing with Equation 3.7, we see that everything will be the same as before 
except that the energy E in our previous description must now be replaced by 
E — Vo. It follows that the energy levels of the well, En, obey the equation 


forn = 1,2,3,... 


In other words all the energy levels are raised by the same constant value, Vo. This 
makes no difference to the observed behaviour of the particle. For example, the 
frequency of light emitted when the particle makes a transition from one energy 
level to another depends only on the difference between the energies involved, and 
this is independent of Vo. 


The stationary-state wave functions still vanish outside the well. Inside the well, 
we adapt Equation 3.24 by replacing En by En — Vo to obtain 


[9 
W(t) = 1 sin (=) e(En—Vo)t/h forQ <a < L. 


75 


Chapter 3 Particles in boxes 


76 


Exercise 3.8 Why is it possible to say that these modified stationary-state 
wave functions describe the same behaviour as the original stationary-state wave 
function in Equation 3.24? a 


Since the location of the zero of energy makes no difference to the physical 
behaviour of a particle in an infinite well, we generally adopt the simplest choice, 
placing the zero of energy at the bottom of the well. However, the decision about 
where to place the walls of the well is less clear-cut. There is an alternative choice 
which has some advantages since it emphasizes the symmetry of the situation. 


Choosing a symmetric well 


Instead of placing the walls of the well at x = 0 and x = L, we can choose them 
to be x = —L/2 and x = + L/2 giving a potential energy function of the form 


V(x)=0 for—L/2<2< L/2, 
V(x) = co elsewhere. 


Such a potential energy function is symmetrical either side of the origin, i.e. 
V(—a) = V(x), so we will refer to it as a symmetric well. 


We could obviously set about solving the time-independent Schrodinger equation 
for a symmetric infinite square well, following much the same procedure as used 
above for an unsymmetrical well. However, there is no need to go back to scratch 
in this way. Instead, we can take our previous solutions and adapt them by making 
a suitable change of variable. 


If we let x’ = x + L/2, a symmetric well that extends from z = —L/2 to 

x = L/2 will extend from x’ = 0 to 2’ = L. Expressed in terms of the primed 
coordinate, x’, this well is the same as that considered earlier, so we know that its 
energy eigenfunctions take the form 


2 — [nnr 
— sin . 
VT’ L 


Returning to the unprimed coordinate, it follows that the energy eigenfunctions in 
the symmetric well are 


itor ism (METI) = Fan (E, 


. 


: Gama : ("=") (72) 4 ("=") : 
sin T. 5 = sin T, cos 5 cos T sin 


Further simplification is possible because n is an integer. 


Ifni : ("ae 4 Br) : (= (= 
is even: -a = 2i 
n is eve sin | -7 5 sin | —— } 008 (y). 
Ifmisodd: sin (= + Z = COS (5) sin (>) ; 
The factors cos(nr /2) and sin(n7/2) that appear in these expressions are equal 
to +1 or —1. This means that they are phase factors (that is, factors of unit 
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modulus). Such phase factors have no physical significance when they multiply 
eigenfunctions, and we are always free to ignore them. We therefore conclude that 
the energy eigenfunctions inside a symmetric infinite square well of length L can 
be written as 


2 ; 

nlr) = 4/ T sin (Z5) for even values of n, (3.28) 
2 NTE 

UE) = 78 (=) for odd values of n. (3.29) 


The energy eigenvalues are the same as before and the stationary-state wave 
functions inside the well are 


[9 
Wout) = T sin (=) eEnt/h for even values of n, (3.30) 


[9 . 
Vagt) = 78 (F) e Ent/h for odd values of n. (3.31) 


Outside the well, the eigenfunctions and wave functions are equal to zero. 


Worked Example 3.1 Essential skill 

Show that the functions ~,,(x) in Equations 3.28 and 3.29 satisfy the Verifying that a given function 

time-independent Schrödinger equation and find the corresponding energy satisfies the time-independent 

eigenvalues. State the continuity boundary conditions for a symmetric Schrödinger equation together 

infinite square well of width L and verify that these functions satisfy them. with appropriate boundary 
conditions 

Solution 


Differentiating either of the functions in Equations 3.28 and 3.29 twice with 
respect to x gives 


dn nT \ 2 

dz Gy Pha). 

Substituting either of the functions into the time-independent Schrödinger 
equation then gives 


B (- x) Un(t) = Eyn(2), 


which is satisfied provided that E = n?n? ñ? /2mL?. This is the energy 
eigenvalue corresponding to the energy eigenfunction y(x). 


For a symmetric infinite square well with walls at x = —L/2 and x = L/2, 
the continuity boundary conditions are 


(—L/2) = ¥(L/2) = 0. 


The eigenfunctions in Equations 3.28 and 3.29 satisfy these boundary 
conditions because 


sin (-=) =0, sin Se) =0 forneven, 
cos (-=) =0, cog (>) =() ior ool, 
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The energy eigenfunctions of a symmetric infinite square well have some special 
features that turn out to be very useful in calculations. These features relate to 
their evenness or oddness. 


Odd and even functions 
f(x) is said to be an even function of x if f(—x) = f(x) for all x. 
f(a) is said to be an odd function of x if f(—x) = — f (x) for all x. 


Most functions are neither even nor odd. However, cos x is an even function, and 
sin x is an odd function so, in a symmetric infinite square well, every energy 
eigenfunction is either even or odd. If n is odd, the eigenfunction y,() is an 
even function; if n is even, the eigenfunction Yn (x) is an odd function. In 
particular, the ground state (with n = 1) is an even function, the first excited state 
(with n = 2) is an odd function, and so on. Moreover, the evenness and oddness 
of the eigenfunctions alternates as we climb upwards through the energy levels. 


It turns out (although we shall not prove it here) that all one-dimensional 
symmetric wells (i.e. wells with V(—a) = V (x)) share these properties: all their 
energy eigenfunctions are either even or odd, the ground state is always even, and 
the evenness and oddness alternate as n increases. 


3.2 Two- and three-dimensional infinite square wells 


In this section we generalize the results obtained for the one-dimensional infinite 
square well, first to two dimensions and then to three dimensions. The 
two-dimensional case can be thought of as a sort of stepping stone to quantum 
dots, which confine particles in all three dimensions. 


3.2.1 The two-dimensional infinite square well 


A two-dimensional infinite square well is one that restricts 
two of the coordinates of a particle. It is the kind of box 
that confines the electrons in a quantum wire in the dimensions 
perpendicular to the axis of the wire. For simplicity, we 
will assume that the box occupies a square region in two 
dimensions. Nevertheless, it is worth noting that the term 
“square well’ refers to the abrupt rise of the potential energy 
function, not to the region in which the potential energy 

is zero — it would be perfectly possible to have a square 
well that occupies a circular region in two dimensions, 
although we will not discuss such a case here. 


The potential energy function we shall investigate takes 
the form 


V(xz,y)=0 for0<ax<Land0<y<L, (3.32 
Figure 3.12 The potential energy function for (2,9) ita iis ee 


the two-dimensional infinite square well. V(z,y) =0co_ elsewhere. (3.33) 
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For convenience, we have aligned the walls of the well with the x- and 
y-coordinate axes, chosen one corner of the well to be at the origin, and taken the 
potential energy to be zero inside the well. This potential energy function is 
illustrated in Figure 3.12. 


Starting from a classical system consisting of a particle of mass m, moving under 
the influence of the two-dimensional infinite square well V (x,y), the usual 
procedure leads to the Schrödinger equation 


ih 


OW(x,y,t) E (3 Oru 


ot Om cae o = +V (x,y) (z, y, t). (3.34) 


Because we are interested in finding the energy levels of the system, we will again 
look for stationary-state solutions. In this case we start by seeking solutions of the 
form 


U(z,y,t) =v(2,y)T(t) fr0<xr<Land0< y< L, (3.35) 
P(x, y,t)=0 elsewhere. (3.36) 


We shall now concentrate on the region inside the well. Using the potential 
energy function of Equation 3.32, and substituting the product wave function of 
Equation 3.35 into Schrödinger’s equation, the method of separation of variables 
leads to the following equations for the region inside the well: 


dT 


ih = BT (2), (3.37) 
hn? (Pw dp = 
FE (z + aa) = Ey(z,y). (3.38) 


As always, Equation 3.37 is satisfied by a function of the form 
TQ) = e`iEt/h 


while Equation 3.38 is the time-independent Schrödinger equation for this 
problem. This is the eigenvalue equation for energy; its solutions w(x, y) are 
the energy eigenfunctions and the corresponding values of E are the energy 
eigenvalues. 


Equation 3.38 can be further separated by looking for solutions of the form 
W(x, y) = X(x)Y (y). Substituting this expression into Equation 3.38 and 
dividing both sides by X(x)Y (y) leads to the equation 


R? 1 d?X(z) 1 Y) 
2m ae dx? Y(y) dy? Jas 


The left-hand side of this equation can be written as the sum of a function of x 
and a function of y, with the property that the value of this sum is constant. Given 
our freedom to choose the values of x and y independently, how can the sum of a 
function of x and a function of y remain constant? The only way is if the functions 
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are themselves constants (not necessarily the same). This allows us to write 


ì^? 1 d?X(a) 
=F ; 
2m X(x) dz? = RAR 
Rì è 1 @Y(y) 
=F ; 


where Ex and Ey are constants that satisfy Ex + Ey = E. 


Now, Equations 3.39 and 3.40 have the same form as the one-dimensional 
time-independent Schrödinger equation that was solved in Section 3.1. Even the 
boundary conditions are the same, since X(x) must vanish at x = 0 and x = L 
and Y (y) must vanish at y = 0 and y = L. So, inside the well, 


X(x) = An, sin (=) and Y(y) = An, sin (=) ; (3.41) 
where An, and An, are constants and ng and ny are positive integers with 
2272 2272 
nin h ngn h 
2 =E d ——— = Ey. 3.42 
2mL? n = on 


The energy eigenfunction inside the well can therefore be written as 


Wngny(£,yY) = X (x)Y (y) = Angyn, sin (=) sin (=) F 


where An,,ny = An, An, is a constant that can be found by normalizing the 
eigenfunction. In the present context, this implies that 


f f remy (2,9)? da dy = 1, 


that is, 


[Anant sin? (=) dx L sin? (2) dy= 1. (3.43) 


-00 


Each of the definite integrals on the left-hand side of Equation 3.43 is of a form 
already met in Section 3.1.4, and is equal to Jf L/2. Taking the normalization 
constant to be real and positive, we conclude that the normalized energy 
eigenfunctions are 


2 (Matty 
rng my(@,9) = = sin (“= ) sin (2), (3.44) 


inside the well, with Yn, n, (£, y) = 0 outside. Using E = Ex + Ey, together 
Note that in two dimensions, the with Equation 3.42, the corresponding energy eigenvalues are 
energy eigenfunctions, energy 
eigenvalues and stationary-state 
wave functions are labelled by 
two quantum numbers, 
Ng and ny. 


(n2 + nen? A? 


ean (3.45) 


Enz Hd — 


Finally, putting everything together, the normalized stationary-state wave 
functions are 


2 ? 
Tray (® ht) = = sin (“F=) sin (“P= ) erent, (3.46) 


inside the well, with Yn, n, (x, yY, t) = 0 outside. 
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Figure 3.13 provides snapshots of the real part of one of these wave functions, for 
the case nz = 1 and ny = 2. The snapshots are taken at five instants t to ts, 
starting with tı. Note that the real part of the wave function again takes the form 
of a standing wave, though this time it is two-dimensional, like the standing wave 
that might be observed on the head of a rectangular drum or some other stretched 
membrane with fixed edges. Notice also that Yj (x, y, t) contains a nodal line 
at y = L/2, along which the wave function is permanently zero. Such lines 
correspond to the nodal points (nodes) that were present in the one-dimensional 
case. 


The probability of finding the particle in a small rectangular region of area 

dx x dy centred on the point (x, y) is |Vy 2(x, y, t)|? 6x dy. The corresponding 
probability density |W, (a, y, t)|? is independent of time, since the 
time-dependent phase factor has unit modulus, just as in the one-dimensional 
case. This is what you would expect for a stationary state. 


Exercise 3.9 Show that for a stationary state described by the wave function 
Vnz,n, (T, Y, t), the corresponding probability density is independent of 
time. B 


3.2.2 Degeneracy 


The first few energy eigenvalues of a two-dimensional infinite square well are 
shown in the energy-level diagram of Figure 3.14 (overleaf). Here, each state is 
specified by an ordered pair of integer quantum numbers (ng, ny), and the 
corresponding energy level is indicated by the position of a horizontal line on a 
vertical scale. Each state is distinct, for example the wave function Y4 (2, y, t) is 
not the same function as V2 1(z,y,t). Yet some of the states have the same 
energy. This is the case for the states described by Vj 2(x, y, t) and V2 1(z, y, t) 
for example, and for the states corresponding to Y1 3(x, y, t) and V3 1(z, y,t). 


The occurrence of different quantum states (i.e. states described by different wave 
functions) with the same energy is referred to as degeneracy. An energy level that 
corresponds to two or more quantum states is said to be degenerate. Moreover, 
states that share the same energy are said to be degenerate with one another. 


The number of states that correspond to an energy level is called the degree of 
degeneracy of that level. As Figure 3.14 shows, several of the energy levels are 
doubly degenerate or two-fold degenerate; that is to say there are two different 
states that correspond to that particular energy eigenvalue. This degree of 
degeneracy is not altogether surprising when the two-dimensional well occupies a 
square region of space. It is a consequence of the square’s symmetry, and could be 
removed by distorting the square region of space into a rectangular one. However, 
other degeneracies are also possible. For instance, in the case of the square 

well occupying a square region of space, the level of energy 507?h?/2mL? is 
three-fold degenerate since 50 may be written as 7? + 1°, 1? + 7? and 5? + 5°. 


It is interesting to note that degeneracy is a phenomenon that first reveals itself in 
two dimensions. There are no degeneracies in one-dimensional wells. 


Exercise 3.10 | Which other states of the particle in a two-dimensional infinite 
square well are degenerate with the ng = 7, ny = 4 state? What is the degree of 
degeneracy of the corresponding energy level? | 


Q 


Re (Wie(x,y,t1)) 


Figure 3.13 


Snapshots of the 


real part of Y1 (x, y, t) at five 
instants, starting with tı. 
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cnergy A 
20r | (4,2) (2,4) 3.2.3 The three-dimensional infinite square well 
2mL? (3,3) 
=> AA A three-dimensional infinite square well is one that traps the particle in all three 
15r2h? (4,1) dimensions. It is the kind of box that confines the electrons in a quantum dot. 
Sale | (3,2) 2,3) Given our discussion of one- and two-dimensional infinite square wells, the 
= = quantum mechanics of a particle of mass m in a three-dimensional infinite square 
10022 3,1) (1,3) well should hold few surprises, so we proceed by stating the conclusions. 
2mL? | (2,2) For simplicity, we assume that the box occupies a cubic region, with sides of 
length L. Then the states are described by three quantum numbers ng, ny and nz, 
Ba | (2,1) (1,2) each of which may take any of the values 1, 2,3,4,... etc. The energy level 
2mL* corresponding to any specific choice of the positive integers ng, ny and nz is 
1,1) 
ae 2 2 DY em 
mea OF Oe) a a 
oL- Brongn = Ua (3.47) 


2mL? 
Figure 3.14 Part of an 
energy-level diagram for a 
particle in a two-dimensional 
infinite square well. 


Inside the well, the normalized stationary-state wave function describing the state 
characterized by nz, ny and nz is 


Une nye (26; Y, Z, t) = Ung nye (a, Y, 2) e Foaroan (3.48) 


where Yn, my,n, (£, Y, Z) is a normalized energy eigenfunction that satisfies 
the time-independent Schrödinger equation for the problem. The form of that 
equation, and of Ung ny Ne (x,y,z), depends on the precise location of the square 
well. If the well occupies the region in which 0 < x < L and0 < y < Land 
In three dimensions, the energy 0 < z < L then the normalized energy eigenfunction inside the well is 
eigenfunctions, energy 
eigenvalues and stationary-state (2 . (PaT) . (NyTYN\ . (NzTz 3.49 
wave functions are labelled by Pave smy na (®s Ys 2) = L sin ( L ) sin ( L ) sin ( L ) oe 


three quantum numbers 
Ney Ny and nz. Everywhere outside the well, the eigenfunction is equal to zero. 


If any of these results (including the normalization constant (2/ iy? 2) comes as a 
surprise, you should review the treatment of the two-dimensional infinite square 
well, or, if necessary, work through the three-dimensional problem in detail for 
yourself. 


Exercise 3.11 (a) Obtain an expression for the difference in energy between 
the two lowest energy levels of a particle of mass m confined to a cube with sides 
of length L by a three-dimensional infinite square well potential energy function. 


(b) What is the degree of degeneracy of each of these two energy levels? 


Exercise 3.12 | What are acceptable SI units for the stationary-state wave 
functions in infinite square wells in one dimension, two dimensions and three 
dimensions? Explain why your answers are consistent with Born’s rule. a 


3.2.4 F-centres and quantum dots 


The quantum-mechanical system of a particle trapped in a three-dimensional 
infinite square well provides a reasonable model for some real physical systems. 
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We shall briefly consider two of these systems, F-centres and quantum dots, 
including experimental results that support some of our theoretical analysis. 


First consider a crystal — that is, a regular arrangement of atoms. A simple 
example is common salt, sodium chloride (NaCl), in which the chlorine atoms 
(Cl) form the arrangement shown in Figure 3.15, while equal numbers of smaller 
sodium atoms (Na) fill the spaces between. In fact, each sodium atom loses one 
electron, and each chlorine atom gains one, so it is more accurate to say that the 
crystal is composed of ions (denoted by Na‘ and C17). The electrical attraction 
between oppositely-charged ions is responsible for keeping the sodium chloride 
crystal bound together: it is said to be an ionic crystal. 


Figure 3.15 The arrangement of sodium (+) Figure 3.16 An F-centre created by the 

and chlorine (—) ions in crystalline salt (NaCl). ejection of a negative ion and its replacement by 
an electron. The shading schematically indicates 
the probability density for the electron in one 
particular state of the F-centre. 


A perfect crystal of sodium chloride would be transparent, but perfect crystals are 

rare. Table salt looks white because each grain contains many defects in its 

otherwise regular arrangement of ions. One special type of defect, called an 

F-centre, is especially interesting for us. It is possible for a negative chlorine ion The term ‘F-centre’ comes from 
to be ejected from the crystal and have its place taken by an electron, as indicated the German word ‘Farbe’ 
in Figure 3.16. The electron, whose closest neighbours are positive ions (Na*), meaning colour. 

behaves as a particle trapped in a three-dimensional well. The exact shape of the 

well depends on the details of the crystal, but the crucial point is that the electron 

has its own characteristic set of energy levels, and transitions between these 

energy levels allow the F-centre to absorb electromagnetic radiation. In the case 

of NaCl, the absorbed radiation is visible light, in the violet part of the spectrum. 

Consequently, a sufficient concentration of F-centres, produced by irradiating an 

NaCl crystal with gamma rays for example, has the effect of turning the salt 

crystals an orange—brown colour, since that is the complement of violet (i.e. the 

result of subtracting violet from white light). 


We can roughly model an F-centre by treating it as a cubical three-dimensional 
infinite square well containing a trapped particle. The length L of a side of the 
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cube is taken to be proportional to the separation of two similar ions in the crystal 
(the so-called lattice spacing). In sodium chloride and similar crystals, any visible 
light absorbed by an F-centre corresponds to transitions between neighbouring 
energy levels. One obvious prediction, emerging from Equation 3.47, is that the 
energy difference between neighbouring levels is proportional to 1/L7. So, if we 
take a range of different ionic crystals, all with F-centres, we might expect the 
characteristic frequency of absorption to be proportional to the inverse square of 
the lattice spacing. Figure 3.17 shows that this prediction is quite accurate. There 
are some minor discrepancies, but this is only to be expected bearing in mind that 
an electron in an F-centre is in a well of finite depth, and experiences a force when 
it is inside the well. 


ae 
Lir 
1.07 
Meie 
0.8 
OVE 
0.6 T 
0.57 
0.47 
0.37 


absorbed frequency/10°* Hz 


1 | 1 | 1 
0.4 0.5 0.6 0.7 0.8 


lattice spacing/nm 


Figure 3.17 A plot of the frequency of absorbed light against lattice spacing 
for a range of ionic crystals containing F-centres. These data are broadly 
consistent with the relationship: frequency œ 1/(lattice spacing)’. 


A similar pattern is observed in quantum dots. The variation of colour shown in 
Figure 3.2 arises because small quantum dots have energy levels that are far apart, 
leading to high frequencies of absorbed light, while larger quantum dots have 
energy levels that are closer together, leading to smaller frequencies of absorbed 
light. Quantum dots are more complicated than F-centres because the trapped 
electrons are now in a semiconducting medium, rather than in empty space. This 
complicates the picture, as you will see later in the course, but in many cases the 
frequency of emitted light is a linear function of 1/ZŽ, which can be regarded as 
being a consequence of Equation 3.47. 


3.3 The finite square well 


The wells considered so far have all had infinite depth, ensuring that the particle 
remains completely trapped inside their walls. In this final section, we consider a 
one-dimensional well of finite depth. You will see that the particle then stands 
some chance of being found outside the well. 


As before, we start by giving a classical description of the system; we then write 
down Schrédinger’s equation, solve it for stationary states subject to appropriate 
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boundary and normalization conditions, and interpret the resulting wave functions. 


3.3.1 The system and its Schrodinger equation 


Figure 3.18 shows the potential energy function of a one-dimensional finite 
square well of width L and depth Vo. In this case we have chosen to consider a 
symmetric well with boundaries at x = +L/2. In another change of convention 
we have chosen to associate zero potential energy with the region outside the well. 
This is a reasonable convention in the present case, because it corresponds to 
saying that the particle has zero potential energy when it is far from the influence 
of the well (and therefore free). 


According to classical physics, a particle trapped in the well will have a negative 
potential energy —Vo, a positive kinetic energy in the range 0 < Ekin < Vo, anda 
negative total energy in the range —Vo < E < 0. Energy considerations then 
imply that the regions outside the box, where the particle’s total energy would 
have to be positive, are classically forbidden to the negative-energy particle bound 
inside the well. 


Figure 3.18 The potential energy function of a symmetric one-dimensional 
finite square well of width L and depth Vo. 


The potential energy function may be written as 


V(z)=—-Vo for—L/2 < x< L/2, (3.50) 


V(x)=0 elsewhere. (3.51) 


Apart from the potential energy function, the only other ingredient of our classical 
system is the particle itself which is characterized by a mass m. So the parameters 
describing the system are m, Vo and L, which are all positive quantities. 


The quantum-mechanical description of the system is based on Schrédinger’s 
equation 


pOule.t) _ R? 2°V (a, t) 
i ðt —— 2m Az? 


We are looking for stationary-state solutions of the form U(x, t) = y(x) 


+V(x)W(z, t). (3.52) 


et Et/h 
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where Æ < 0 and the function y(x) satisfies the time-independent Schrödinger 
equation, 
h? d?y(2) 
2m da? 


+V(x)v(x) = Ev (a). (3.53) 


Substituting the finite square well potential energy function (Equations 3.50 and 
3.51) into this equation gives 


had- 
2m a Voy(z) = Ey(x) for-L/2<a< L/2, (3.54) 
22, 
o ee = Ey (a) elsewhere. (3.55) 


We now have two differential equations to solve, one inside the well and the other 
outside. The allowed solutions are subject to boundary conditions. Firstly, we 
must ensure that y(x) does not diverge as x — +00. Secondly, we must make 
sure that the solutions to Equation 3.54 join on smoothly to the solutions to 
Equation 3.55. Although the potential energy function has abrupt steps, it does 
not become infinite, so the continuity boundary conditions require that both the 
eigenfunction y(x) and its derivative dy/dx are continuous. In particular, 


w(x) is continuous at x = +L/2, (3.56) 
ma is continuous at x = +L/2. (3.57) 


In the next section we will solve Equations 3.54 and 3.55 subject to these 
boundary conditions. 


3.3.2 Solving the time-independent Schrödinger equation 
The region outside the well 


Let us start by investigating the part of the eigenfunction outside the well, which 
must satisfy Equation 3.55. At first sight this seems to be similar to the problem 
we solved inside the infinite square well, so we might expect to simply modify the 
solution given in Equations 3.10 and 3.11. This approach would lead to a solution 
of the form 


y(x) = Asin(kx) + B cos(kx) forx < —L/2 orx > L/2, 


where A and B are arbitrary constants and k = v2mE/ħ. However, in the 
present case the energy eigenvalues E are negative, so k is imaginary. Under these 
circumstances it is better to take a different approach; to consider each part of the 
outside region separately and write the general solution to Equation 3.55 in the 
form 


y(x) = Ae” + Be% for x < —L/2, (3.58) 


y(x) = Fe* + Ge for x > L/2, (3.59) 
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where A, B, F, and G, are arbitrary constants, and œ is the positive constant 


O Remember, we are considering 
SE l co) the case Æ < 0. 

Exercise 3.13 Show that w(x) = Ae** + Be~°* is the general solution to the Section 8.2 explains what is 
time-independent Schrödinger equation for the one-dimensional finite square well meant by the general solution of 
in the region x < —L/2, provided that a = \/2m(—E)/h. E = adifferential equation. 


One reason for writing the solutions outside the well in the form of Equations 3.58 
and 3.59 is that we can easily identify how these functions behave far from 

the well. In Equation 3.58 (where x is negative) the term Be~°” diverges as 

xz — —oo, so we conclude that B = 0. Similarly, in Equation 3.59 (where x 

is positive) the term F'e°” diverges as x — oo, so we conclude that F' = 0. 
Imposing these conditions we can now say that outside the well 


Wla = Ae” forg < —L/2, (3.61) 


Ue) = Geo tora >.) 2. (3.62) 


In addition, we are dealing with a symmetric well. In any symmetric well, the 
eigenfunctions are always either even or odd functions, so we conclude that: 


for even eigenfunctions A= G, 


for odd eigenfunctions A= -—G. 


When dealing with one-dimensional wells, it is always possible to choose the 
energy eigenfunctions to be real functions. This allows us to draw meaningful 
graphs of y(a) for the one-dimensional finite square well, just as we did in the 
earlier case of the infinite square well. Figure 3.19 shows the relevant graphs for 
the parts of the eigenfunction that lie outside the well in cases of even 
eigenfunctions and odd eigenfunctions, as described by Equations 3.61 and 3.62. 


wh 


0 a 


= 0 Ba T 


Figure 3.19 The behaviour 
(a) even cigenfunction of: (a) an even eigenfunction 
and (b) an odd eigenfunction 


Py outside a one-dimensional 
finite square well. In each 
0 case, the eigenfunction 
—£/2 0 T decreases exponentially with 


Lf2 distance from the walls of the 


well. For the purposes of 
graph plotting, A has been 
taken to be positive. 


(b) odd cigenfunction 
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Remember cos(—2) = cos x. 
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These diagrams assume that A and G are non-zero and you will see shortly that 
this is always the case. They reveal that the energy eigenfunction for a state with 
negative energy extends outside the well. This implies that, in quantum physics, 
there is a possibility of detecting the particle in the classically forbidden region. 
This purely quantum phenomenon, often referred to as barrier penetration, has 
no classical analogue. It is another indication of the vast gulf that separates 
classical from quantum physics. 


Outside the well, the energy eigenfunctions approach zero exponentially with 
increasing distance from the walls of the well. Whatever the value of the 
eigenfunction at the edge of the well, its value will have decreased by a factor of 
1/e = 0.368 at a distance 1/a = h/\/2m(—E) beyond the wall. This implies that 
the eigenfunctions corresponding to the lowest energy eigenvalues (i.e. those with 
the largest values of the positive quantity — E) will decrease most rapidly with 
distance from the walls of the well. 


The region inside the well 


To find the energy eigenfunctions inside the well, we must solve Equation 3.54. In 
this case we can expect the solutions to be similar to those we met in the case of 
the symmetric infinite square well, so we shall immediately state that: 


for even eigenfunctions y(x) = Dcos(kz), (3.63) 


for odd eigenfunctions (x) = Csin(kz), (3.64) 


where C and D are arbitrary constants and z lies in the range from —L/2 to L/2. 


Substituting Equation 3.63 or 3.64 into Equation 3.54 shows that the positive 
quantity k must be related to the energy eigenvalue E and the well depth V as 
follows: 


pe = vam(E + Vo) (3.65) 


h 


Although F is negative, the quantity Æ + Vo is positive; it is the energy above the 
bottom of the well, and may be thought of as the analogue of the classical kinetic 
energy. 


Figure 3.20 shows two of these eigenfunctions in the region inside the well. There 
is no reason to suppose that the eigenfunctions vanish at the edges of the well; 
what matters is that they should join on smoothly to the eigenfunctions outside the 
well plotted in Figure 3.19. 


Matching solutions at boundaries 


Following our experience with the infinite square well, we can expect the allowed 
energy eigenfunctions and eigenvalues to arise from the continuity boundary 
conditions imposed at the edges of the well. The requirements that w(x) and 
dy/dx should each be continuous at x = —L/2 lead to the following conditions 
for the even eigenfunctions: 


Ae~°4/2 = Dcos(kL/2), (3.66) 


aAe~°4/? = kD sin(kL/2), (3.67) 
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and to the following conditions for the odd eigenfunctions: 


Ae~*4/? — _Csin(kL/2), (3.68) 
Remember sin(—x) = — sin z. 
aAe~%/? = kC cos(kL/2). (3.69) 
ws 
VS 
= 0 Te i 


(a) even cigenfunction 


RY 


= 


(b) odd eigenfunction 


Figure 3.20 The behaviour of: (a) an even eigenfunction and (b) an odd 
eigenfunction inside a one-dimensional finite square well. In each case, the 
eigenfunction has an appreciable non-zero value at the walls. 


Dividing each side of Equation 3.67 by the corresponding side of Equation 3.66, 
and rearranging, gives 


for even eigenfunctions: ktan(kL/2) =a. (3.70) 


Similarly, dividing each side of Equation 3.69 by the corresponding side of 
Equation 3.68, and rearranging, gives 


for odd eigenfunctions: kcot(kL/2) = —a. (3.71) 


Recalling that a = \/2m(—E)/h and k = \/2m(E + Vo)/h, we see that 
Equations 3.70 and 3.71 implicitly determine the energy eigenvalues, Æ in terms 
of the parameters m, Vo and L that characterize the system. There is no way 

of solving these equations algebraically to get an explicit expression for F. 
However, it is relatively easy to solve them numerically, using a computer. When 
this is done, a finite number of negative energy eigenvalues is obtained: #1, 

Fg, .... For a one-dimensional finite square well, there is always at least one 
eigenvalue, but the precise number depends on the width and depth of the well 
and the mass of the particle. 


For each energy eigenvalue, E,,, a corresponding eigenfunction 7,,(x) can be 
found from Equations 3.61, 3.62, 3.63 and 3.64. Given the value of En, we can 
determine a and k and go back to Equations 3.66 and 3.68 to find the ratios D/A 
and C/A. Finally, we impose the normalization condition by requiring that 


f iwa de = 1. (3.72) 
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Note that, the integral runs from —oo to oo rather than from —L/2 to L/2. This is 
because we can no longer presume that a particle with negative energy is restricted 
to the well, even though it would be so restricted in classical physics. The 
normalization condition (Equation 3.72) leads to a value for the constant A and 
hence to an explicit expression for the normalized energy eigenfunction, Yn (£). 


Figure 3.21 shows the energy eigenvalues and the corresponding eigenfunctions 
that result from one choice of m, Vo and L. For the particular parameters chosen, 
there are three energy levels below the top of the well, but this number will vary 
from well to well. Many of the features of the eigenfunctions are similar to those 
of an infinite square well. In the lowest energy level, Æ, the eigenfunction (x) 
is even and has no nodes. In the next energy level, E2, the eigenfunction 72(x) is 
odd and has one node. The alternating pattern of evenness and oddness continues, 
with each successive eigenfunction gaining one extra node. In general, the nth 
eigenfunction has (n — 1) nodes and is an even/odd function according to whether 
n is odd/even. 
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(b) 


Figure 3.21 (a) The energy levels F4, Fy and Eg of a one-dimensional finite 
square well; (b) the eigenfunctions 7, Y2 and w3 plotted on a single set of axes. 
The time-independent probability densities |Y» (x)|? corresponding to these three 
energy eigenfunctions are shown in Figure 3.22. The probability density drops to 
zero at each node of the eigenfunction, indicating that there is little chance of 
finding the particle near these points. 


The probability densities for a finite well extend beyond the boundaries of the 
well, showing that a particle of negative energy may be found outside the well 
(the phenomenon of barrier penetration). The particle penetrates further into the 
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classically forbidden region as its energy increases. For energy levels very near 
the top of the well, the particle may even be more likely to be found outside the 
well than inside it. 


probability A 
density 


Figure 3.22 The probability densities |y (x)|?, |w2(a)|? and |q3(x)|? for the 
case shown in Figure 3.21. 


Something like this happens in the deuteron — a proton and a neutron bound 
together by attractive nuclear forces. Of course, this is a three-dimensional 
system, which leads to some differences in detail. Nevertheless, it is possible to 
devise a model for the deuteron involving a finite square well of depth 38.5 MeV 
occupying a spherical region of radius 1.63 x 10715 m. Using this model, only 
one energy level is found below the top of the well. This is the ground state of the 
deuteron; there are no excited states in which the proton and neutron remain 
bound together. In the ground state, the eigenfunction describing the separation of 
the proton and neutron extends far beyond the confines of the well. In fact, the 
root-mean-square separation of the proton and neutron is 4 x 10715 m, and there 
is about a 50% chance of finding the particles in a classically forbidden region. 


So far, we have restricted attention to bound states. These are states in which the 
particle has a vanishingly small probability of being found very far from the well. 
They are described by normalized wave functions and have negative energies, 
below the top of the well. You might wonder what happens for positive energies, 
above the top of the well. It is certainly possible to find energy eigenvalues and 
eigenfunctions in this region. The eigenfunctions oscillate sinusoidally both 
inside and outside the well. However, these eigenfunctions carry on oscillating 
infinitely far from the well, and cannot be normalized. It follows that a single 
energy eigenfunction cannot describe an unbound particle. Later in this book, you 
will see that unbound particles are best described by linear combinations of 
energy eigenfunctions (wave packets). 
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3.3.3 Computations and generalizations 


It should be clear by now that a particle in a one-dimensional finite square well is 
a more difficult system to analyse than a particle in an infinite square well. This is 
why we dealt with the infinite square well first, despite the fact that, from a 
mathematical point of view, the infinite square well is best regarded as a limiting 
case of the finite square well. To make further investigations of particles in boxes, 
you should now use the computer package One-dimensional wells. The package 
should give you insight into the way that parameters, such as m, L and Vo, 
influence the energy eigenvalues and eigenfunctions. The package will also allow 
you to investigate the solutions that arise for more general one-dimensional wells, 
such as those shown in Figure 3.23. 


Figure 3.23 A more general type of one-dimensional well: (a) an energy 
eigenvalue Es and (b) the corresponding energy eigenfunction 5. 


When using the package, you should bear in mind the following points, which 
apply to all wells in one-dimension. 
e Bound states have discrete energy eigenvalues. 


e In one dimension, there is no degeneracy: each energy level corresponds to one 
eigenfunction. 


e In one dimension, energy eigenfunctions may be taken to be real. 


e The ground state eigenfunction has no nodes. When arranged in order of 
increasing energy, each successive eigenfunction has one more node than its 
predecessor. 


Summary of Chapter 3 


e For asymmetric well, each eigenfunction is either an even or an odd function. 
When arranged in order of increasing energy, this property of evenness or 
oddness alternates from one eigenfunction to the next. 


Install and run the computer package One-dimensional wells. Work through 
the whole of the package following the on-screen guidance notes. 


Summary of Chapter 3 


Section 3.1 A particle of mass m in a one-dimensional infinite square well, with 
walls at x = 0 and x = L, has an infinite set of discrete energy levels 


nn h? 


Ey, = mI forn = 1,2,3,..., 


with corresponding normalized energy eigenfunctions 


2 
Yn(2) = 4/ F sin (==) for0<a2< L, 


Wn(x) =0 outside the well. 


The stationary-state wave functions 


[9 
Wilt) = T sin (=) e`iEnt/À for0< g< L, 


are complex standing waves describing states of definite energy, En. 


These solutions are obtained by separating variables in the Schrödinger equation 
and solving the time-independent Schrödinger equation, subject to continuity 
boundary conditions. These conditions require that the energy eigenfunction w(x) 
is continuous everywhere, and that dy/dx is continuous at points where the 
potential energy function is finite. At the walls of an infinite square well we only 
require the continuity of ~)(x), and this leads to energy quantization. 


Shifting the walls of the well without altering its width, or assigning a different 
constant potential energy to the bottom of the well, will change the way the 
system is described, but not the way it behaves. Each energy eigenfunction in a 
one-dimensional symmetric well is either even or odd. 


Section 3.2 The infinite square well can be generalized to two and three 
dimensions. For a three-dimensional infinite square well that occupies a cubic 
region of width L, the energy eigenvalues are given by 
(n2 + n? + n2)r?h? 
Enz nynz = ImL? 


If the well occupies the region 0 < x < L,0<y< Land0 < z < L, the 
corresponding normalized energy eigenfunctions are 


3/2 


j ( = 2 esi 
Ne Ny nz L,Y, 2) = T, sın T, sin 
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A three-dimensional infinite square well can be used to model the energy levels in 
F-centres and quantum dots. In two and three dimensions, energy levels may be 
degenerate. 


Section 3.3 The time-independent Schrödinger equation for a particle in a 
finite one-dimensional square well can be solved numerically. For negative 
energies (below the top of the well) a finite number of energy eigenvalues and 
eigenfunctions are obtained. Energy quantization arises from the continuity 
boundary conditions, which now require both y(x) and dy/dx to be continuous 
at the walls of the well. The eigenfunctions, and the corresponding stationary-state 
wave functions, extend beyond the walls of the well. This means that there is a 
non-zero probability of finding the particle outside the well — a characteristic 
quantum phenomenon called barrier penetration. This phenomenon is observed 
in the case of a deuteron, whose constituent particles have a root-mean-square 
separation that is about twice the range of the force that binds them together. 


Achievements from Chapter 3 


After studying this chapter you should be able to: 


3.1 Explain the meanings of the newly defined (emboldened) terms and 
symbols, and use them appropriately. 


3.2 Write down the Schrödinger equation and the time-independent Schrödinger 
equation for the infinite and finite square well in one, two or three 
dimensions. 


3.3 Write down and apply the continuity boundary conditions for energy 
eigenfunctions. 


3.4 Verify that given functions are solutions of the Schrödinger equation and the 
time-independent Schrédinger equation (subject to certain boundary 
conditions); in the case of the infinite well, determine the corresponding 
energy eigenvalues. 


3.5 Describe some characteristic features of energy eigenfunctions and 
eigenvalues, including energy quantization, degeneracy, and barrier 
penetration. 


After studying the computer package One-dimensional wells, you should 
also be able to: 


3.6 Describe in qualitative terms the generalization of the ideas you have met in 
the context of square wells to wells of a more general shape. 


Chapter 4 The Heisenberg uncertainty principle 


Introduction 


Figure 4.1 
grafitti. 


Heisenberg 


The Heisenberg uncertainty principle is one of the most celebrated results of 
quantum mechanics. Many people who have not studied science in any detail have 
heard of this principle, and think of it as expressing the shifty evasiveness of the 
microscopic world, cloaking atoms and subatomic particles in a veil of mystery 
that we are unable to penetrate (Figure 4.1). This is not the scientific view. 

The final section of this chapter will discuss the uncertainty principle, and 

you will see that it makes a clear statement about the results of experiments, 

with consequences that can be tested and confirmed. In order to reach this 
understanding, we must first examine the extent to which predictions can be made 


in an unpredictable quantum world. 


Chapter 1 emphasized the fact that quantum mechanics is 
indeterministic. For example, identical uranium nuclei can 

be in identical states and yet decay at very different times. What 
is more, identical nuclei in identical states can decay in different 
ways, with different products (Figure 4.2). This is not because 
of any differences between the nuclei, or their histories, or their 
surroundings; rather, it appears to be a fundamental fact about 
the world we live in. A given cause does not always produce 
the same effect. Yet there is a pattern behind the randomness. 
Each uranium nucleus has a definite probability to decay, and 
each type of decay in Figure 4.2 is also characterized by a 
probability. So physics is not brought to a halt by indeterminacy, 
but it is forced to deal directly with chance and probability. 

If we are to understand the quantum world, we have no option 
but to use probabilities, since this is what Nature appears to do. 


The first half of this chapter will show how quantum mechanics 
is used to establish the possible outcomes of a measurement, 


137Cs unstable 


E = 1.176 MeV 


beta decay 
93.5% of decays 


137Ba metastable 
— E = 0.662 MeV 


l 
beta decay l 
6.5% of decays | 
| gamma ray 
l 
| 
l 


137 Ba stable E = 0.000 MeV 
= Me 


Figure 4.2 An example of the unpredictable 
nature of quantum mechanics. A nucleus in a 
definite state can decay in different ways, with 
different probabilities. 


and their relative probabilities. This is a crucial issue 

— as important, in its way, as Schrödinger’s equation. Any 
probability distribution can be characterized by two quantities: 
its expectation value and its uncertainty. The expectation value is 
the mean value you would expect to get in the long run, while the 
uncertainty gives the spread of values around this mean value. The second half of 
the chapter will show how expectation values and uncertainties are calculated in 
quantum mechanics, before explaining how uncertainties appear in the uncertainty 
principle. At the end of the chapter you will see how the uncertainty principle 
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allows us to provide rough estimates for the size and ground state energy of a 
hydrogen atom. 


Section 8.4 of the Mathematical toolkit reviews the concept of probability. 
You may like to read it as part of your study of this chapter. 


4.1 Indeterminacy in quantum mechanics 


The great French mathematician Pierre Simon Laplace told Napoleon that he 
would be able to predict the future of the Universe if he knew the present positions 
and velocities of all its particles. Napoleon asked what role God had in this 
Universe, and received the confident reply: ‘I have no need of that hypothesis’. 


Although Laplace greatly exaggerated his powers of calculation, the story does 
emphasize an important point about Newtonian mechanics — it is deterministic. 
In Newtonian mechanics it is possible, in principle, to use exact knowledge about 
the present state of a system to make exact predictions about its future. By 
contrast, quantum mechanics is indeterministic; we can know as much as it is 
possible to know about the state of a system and still be unable to predict what 
will happen if we carry out a measurement on the system. This section will trace 
the origins of quantum indeterminism. You have met many of the ideas in 
previous chapters, so the task is partly one of revision. 


4.1.1 The wave function and Schrodinger’s equation 


In wave mechanics, the state of a system is specified by giving its wave function. 
If we know the wave function of a system at a given time, we know as much as it 
is possible to know about the state of the system at that time. All predictions 
about the behaviour of the system are based on knowledge of its wave function. 


The wave function depends on the spatial coordinates of all the particles in the 
system, and also on time. We shall concentrate on the simplest possible case, 
where a single particle moves along the x-axis. In this case, the wave function 
W(x, t) is a function of the position coordinate x and the time t. In general, this 
function is complex, with both real and imaginary parts. 


The time-evolution of the wave function is given by Schrédinger’s equation, 
ih — = HY, (4.1) 


where H is the Hamiltonian operator of the system. For a single particle moving 
along the z-axis, the Hamiltonian operator is 


< Ro o 
“3m Baz + V) 
where V(x) is the potential energy function, and Schrédinger’s equation takes the 


form 
._ OV h? oy 
ih Ft 5m Oat + V(x) (x,t). (4.2) 
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This partial differential equation is of first order with respect to time. 
Consequently, if we know the wave function at all points now, Schrédinger’s 
equation tells us what the wave function will be at all points in the future (and 
what it was in the past). This time-evolution is entirely predictable, and does not 
shed any light on the indeterministic nature of quantum mechanics. 


4.1.2 Relating the wave function to measurements 


Indeterminacy arises from the interpretation of the wave function 
itself. As an example, consider position measurements. Imagine 
placing a bank of small Geiger counters all along the z-axis, 

as in Figure 4.3. We will assume that each Geiger counter 

is 100% efficient, so that it clicks if it contains a charged particle. 
Now suppose that a charged particle is in a state described by the 
wave function U(x, t), which extends over many Geiger counters. 
Then, if all the counters are switched on simultaneously, one 

or other of them will click, but we cannot predict which one. aa 
Instead, we have Born’s rule, which states that the probability Geiger counters 
of detecting the particle at time t in a small interval of length ôx, 
centred on the position x, is given by 


Figure 4.3 Detecting the position of a particle 
probability = | W(x, t)|? da = U* (x,t) U(x, t) dx. with a bank of Geiger counters. 


Figure 4.3 shows both the wave function Y and the square of its modulus, |W. 
Using Born’s rule, and taking appropriate areas under the graph of |W|?, we can 
say that, at time ¢, Geiger counter A is about ten times more likely to click than 
Geiger counter B, but we cannot say which Geiger counter will actually click. In 
other words, the wave function allows us to predict the possible experimental 
outcomes and their relative probabilities, but it does not allow us to predict exactly 
which outcome will occur in any given instance. 


It is also worth considering the situation immediately after the particle has been 
detected. If Geiger counter C has clicked, we can be sure that the particle is 
somewhere in the vicinity of this Geiger counter — the particle cannot have 
strayed too far because it travels at a finite speed (certainly less than the speed 
of light). Since the square of the modulus of the wave function tells us the 
probability of detecting the particle in various regions, the wave function 
describing the particle immediately after the click must be strongly localized 
around the Geiger counter that happened to click (counter C in this case). So the 
wave function immediately after the click is drastically different from the wave 
function before the click. This phenomenon is called the collapse of the wave 
function. We cannot predict exactly how the wave function will collapse. If 
Geiger counter A had clicked, the wave function immediately after the click 
would have been strongly localized around counter A. However, it is clear that the 
act of measurement drastically affects the wave function; the state of the system 
after the measurement is not the same as the state before the measurement. 


This has an important consequence for experiments designed to test the 
predictions of quantum physics. If we take a single system and perform a 
sequence of measurements on it, the wave function will change unpredictably with 
each measurement, making comparison between theory and experiment difficult. 


97 


Chapter 4 The Heisenberg uncertainty principle 


Figure 4.4 Energy 
eigenvalues and eigenfunctions 
for a particle in a 
one-dimensional infinite square 
well. 


98 


It is generally better to start with a large number of identical systems all in the 
same state — for example, hydrogen atoms all in the ground state — and perform 
the same measurement of each system. Although each system is in exactly the 
same state, a spread of results is typically obtained. For example, we could take 
measurements of position at time t in the state described by the wave function 
W(x, t) and plot the results as a histogram. In the long run of many measurements, 
this histogram is expected to follow the shape of the graph of |Y (x, t)|?. 


It is straightforward to use a wave function to find the probability distribution for 
position. However, quantum mechanics assigns a much greater role to wave 
functions than this. In wave mechanics, the wave function gives a complete 
description of the state of a system, and therefore contains information about the 
probability distributions of other quantities, such as energy and momentum. This 
information is contained in the shape of the wave function. One of the main aims 
of this chapter is to explain how this information can be extracted. 


4.1.3 Stationary states and energy values 
Schrédinger’s equation has a set of special solutions called stationary states in 
which the wave function takes the form 

D(z, t) = Yn(a)e Bn /*, (4.3) 


where ,,(a) satisfies the time-independent Schrödinger equation 


Hn(z) = En tin (x). (4.4) 


We generally find that this equation gives acceptable solutions only for certain 
values of En, or when Enp lies within certain ranges. The special values of Ep that 
allow a solution to be found are called energy eigenvalues, and the corresponding 
functions ~,,(x) are called energy eigenfunctions. Figure 4.4 shows some of the 
energy eigenvalues and eigenfunctions for a particle in a one-dimensional infinite 
square well. 
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4.1 Indeterminacy in quantum mechanics 


The energy eigenvalues are the allowed energies of the system. No matter what 
state the system might be in, if we measure its energy, we always get one or other 
of the energy eigenvalues. The stationary states are special because they are states 
of definite energy. If the system is described by a stationary-state wave function 
WV,(a,t), and we measure its energy, we get the corresponding energy eigenvalue 
En with certainty. So not everything in quantum mechanics is unpredictable; 
sometimes it is possible to predict the result of a measurement with absolute 
certainty. 


Exercise 4.| An isolated system is described by the stationary-state wave 
function V,,(x,t) = Wn(x) e#""/", Show that, at any fixed time t, Yn (x,t) is an 
energy eigenfunction with energy eigenvalue En. What value of energy will be 
found in this stationary state? Does the time of the measurement make any 
difference to your answer? 


Exercise 4.2 A particle of mass m is in a symmetrical infinite 
one-dimensional square well of width L. What are the chances of measuring 
an energy 1077h?/2mL?? E 


4.1.4 Linear combinations of stationary states 


Simple treatments of quantum mechanics sometimes leave the impression that a 
system is always in one or other of its allowed energy levels, and that it passes 
instantaneously from one level to another. For example, in describing the emission 
of light by an atom, we often talk of the atom as being in an excited energy level, 
prior to emitting a photon and jumping down to a lower energy level. There is 
nothing wrong with this description, but it is important to realize that other types 
of state can exist. 


It is perfectly possible for a system to be in a linear combination of stationary 
states, described by a wave function 


P(x, t) = a, Vy (x,t) + agWo(z,t)+---, (4.5) 


where the Y;(x, t) are normalized stationary-state wave functions and the 
coefficients a; are complex constants, scaled suitably to ensure that the wave 
function U(x, t) is normalized. 


Exercise 4.3 Show that the wave function in Equation 4.5 satisfies 
Schrédinger’s equation. | 


The linear combination specified by Equation 4.5 is not a product of separate 
functions of x and t, so it is not a stationary state. However, this function does 
satisfy Schrddinger’s equation, so there is no reason to reject it as a possible wave 
function of the system. It turns out that states like this are not just theoretical 
curiosities; it is perfectly possible to prepare them in the laboratory. For example, 
an atom in its ground state can be excited into a linear combination of two 
closely-spaced excited states by exposing it to coherent laser light with a range of 
photon energies that spans the energy differences between the ground state 

and both of the excited states. Such linear combinations may eventually 

become of great practical and economic interest, as they are at the heart of the 
rapidly-developing subject of quantum computing. For both theoretical and 
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practical reasons, we are therefore obliged to take the linear combination wave 
function of Equation 4.5 seriously. But how should we interpret it? 


It is natural to ask what the energy of the system is in such a state. You might 
imagine that it would be some weighted sum of the energy eigenvalues. For 
example, in the state 


W(x, t) = a(z, t) + agWo(z, t), (4.6) 


you might guess that the energy would lie somewhere between F4 and Ey — 
presumably closer to Æ when |a| >> |ag|, and closer to Ez when |a2| > [a1], 
with a smooth transition in between. But this would be totally wrong. We know 
that every energy measurement yields one or other of the energy eigenvalues, so it 
makes no sense to say that the system has an energy between two neighbouring 
eigenvalues Æ; and E>». 


In fact, we cannot say that the state in Equation 4.6 has any definite energy at all. 
In this state, every measurement of energy yields either E or E>, but these 
results appear randomly, and it is impossible to predict which one of them will 
occur in any single measurement. Nevertheless, a definite pattern emerges. Each 
result has a definite probability, and it is these probabilities that can be predicted 
by quantum mechanics. 


Given the indeterministic nature of the quantum world, the best we can do is to 
ask two questions: 


1. Ina given system, what are the possible values of energy? 
2. Ina given state, what are the probabilities of the various possible energies? 


The first question has already been answered. The possible energy values are the 
energy eigenvalues, obtained by solving the time-independent Schrödinger 
equation for the system under consideration. We now turn to the second question. 
The answer to this question is of major importance, as it is here that the 
predictions of quantum mechanics make direct contact with experiment. 


4.2 Probability distributions 


Quantum mechanics is indeterministic; it tells us what can happen rather than 
what will happen. Fortunately, it also tells us the probabilities of the various 
possible outcomes. This section will introduce the basic law that determines these 
probabilities. 


To take a definite case, we consider measurements of energy in a system 
consisting of a single particle in a one-dimensional infinite potential energy well. 
This could be a square well (as in Figure 4.4) or an infinite well with walls of 
some other shape. In such a well, the energy eigenvalues are all discrete, which 
means that they occupy a set of isolated values rather than a continuum. The 
energy eigenvalues are also non-degenerate, which means that each energy 
eigenvalue corresponds to a unique energy eigenfunction (ignoring an arbitrary 
choice of phase factor, which has no physical significance). It follows that each 
energy eigenvalue, and its corresponding eigenfunction, can be labelled by an 
integer n = 1,2,3,.... 


4.2 Probability distributions 


4.2.1 The overlap rule 


If we measure the energy of the system, we are bound to get one of its energy 
eigenvalues F1, F2,...— but how likely is each value? The answer is given by 
the following rule: 


The overlap rule 


If the energy of a system is measured at time t, the probability of obtaining 
the ith energy eigenvalue F; is given by 


2 


ne | [ @ ve. ax) (4.7) 


where W(x, t) is the wave function of the system and (x) is the energy 
eigenfunction corresponding to eigenvalue E;. Both the wave function and 
the eigenfunction are assumed to be normalized. 


The integral on the right-hand side of Equation 4.7 will be called the 
overlap integral of y;(x) and U(z, t). 


Let us assume, for the moment, that y (x) and Y (x,t) are both real functions. 
This will help us to give a graphical interpretation of the overlap rule. Figure 4.5a 
shows a case in which the energy eigenfunction 4) (x), corresponding to the 
eigenvalue E}, does not overlap with the wave function (x,t) at all. In this case, 
the overlap integral is equal to zero, so there is no chance of obtaining the value 
E; in an energy measurement. In Figure 4.5b, there is a small overlap between the 
energy eigenfunction and the wave function, leading to a small probability of 
obtaining the value FE, while in Figure 4.5c, there is a much larger overlap, 
leading to a much larger probability of obtaining the value F; in an energy 
measurement. The probability of obtaining a particular energy eigenvalue is given 
by the square of the modulus of the corresponding overlap integral. In the example 
shown in Figure 4.5d, the overlap integral is negative, but this does not matter; the 
probability of getting the eigenvalue Æ; is just as high as in Figure 4.5c. 
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Figure 4.5 The overlap 
between an energy 
eigenfunction 71 (x) and a wave 
function U(x, t): (a) no overlap; 
(b) a small amount of overlap; 
(c) and (d) large amounts of 
overlap. 
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It is important to note that cancellations can occur in an overlap integral. 

Figure 4.6 shows a case where a positive contribution to the overlap integral from 
x > 0 is exactly cancelled by a negative contribution from x < 0. The overlap 
integral in this case is equal to zero, so the probability of obtaining the eigenvalue 
E; is equal to zero. In general, two functions with a vanishing overlap integral are 
said to be orthogonal to one another. The word ‘orthogonal’ is borrowed from 
geometry, where it means “perpendicular to’. In the present context, we can think 
of the square of the modulus of the overlap integral of two normalized functions 
as being a measure of their similarity; this quantity is equal to 1 for two identical 
normalized functions, and it is equal to zero for two orthogonal functions. So 
orthogonal functions are thought of as being completely dissimilar. 


In order to draw graphs, we have restricted attention to functions with real values. 
In general, both the wave function and the energy eigenfunctions may have 
complex values, so the graphs in Figures 4.5 and 4.6 should be regarded as 
emblems of something more general. 


Apart from the difficulties of drawing graphs, there are no special difficulties in 
handling complex wave functions or eigenfunctions. We can still calculate the 
appropriate overlap integral, and then take the square of its modulus to find the 
probability of a given energy value. It is important to note that Equation 4.7 
involves ~)*(x), the complex conjugate of the ith energy eigenfunction. In many 
cases, the energy eigenfunctions can be chosen to be real, so complex conjugation 
may seem unnecessary. However, you will eventually meet energy eigenfunctions 
that are complex, so it is best to include complex conjugation from the outset, 
since it is needed in the general case. 


You may recall from Chapter 1 that a probability amplitude is something whose 
modulus squared is equal to a probability. The overlap rule can therefore be 
interpreted as telling us that the probability amplitude for measuring the energy 
eigenvalue F; is given by 


oO 
probability amplitude = f W; (x) (x, t) dx. (4.8) 
OO 


Hence, in wave mechanics, a probability amplitude can be found by evaluating an 
overlap integral. The probability is found by taking the square of the modulus of 
the probability amplitude. Do not confuse probabilities and probability 
amplitudes. While the probability amplitude may be a negative or complex 
number, the probability is always real and positive. 


The overlap rule applied to a stationary state 


The overlap rule is a fundamental principle of quantum mechanics, as important 
in its own way as Schrédinger’s equation. We will now explore some of its 
consequences. We begin with the simplest case — a stationary state. Suppose that 
a system is described by the stationary-state wave function 


WU; (x,t) = yila) eE, 


corresponding to the energy eigenfunction ~;(x) and energy eigenvalue E;. We 
know that any measurement of the energy in this state is bound to give the 
value E;. Let’s see how this fact emerges naturally from the overlap rule. 


4.2 Probability distributions 


For simplicity, let us suppose that the energy measurement is made at time t = 0, 
when the wave function is U(x,0) = y;(x) e? = y(x). Then the overlap rule 
tells us that the probability of obtaining the value Æ; is 


oo 2 
a | i Ut (2) W(x) de} =1, 


where the last step follows because the eigenfunctions are assumed to be 
normalized. The fact that this probability is equal to 1 confirms that the eigenvalue 
E; is certain, as expected. 


If we accept the overlap rule as a general principle of quantum mechanics, another 
far-reaching conclusion emerges. It is a simple matter of logic to note that if the 
value E; is certain, then any other value Æ; # E; is impossible. The overlap rule 
therefore requires that 


2 


Dj = f Wj (2) wi(x)dz| =0 for By # Ei. 


Bearing in mind that the energy eigenvalues are non-degenerate, the condition 
Ej # F; is equivalent to saying that j 4 i. Moreover, the equation |z = 0 can 
be satisfied only if z = 0. We therefore conclude that 


T pj (2) pi(x)dr =0 forj#i. (4.9) 


Using the definition of orthogonal functions given earlier, this means that: 


Any two energy eigenfunctions with different energy eigenvalues are 
orthogonal to one another. 


Combining the two words orthogonal and normalized, the set of energy 
eigenfunctions is said to be orthonormal, which means that 


A g l Jl forj=i, 
Euowou-h ET (4.10) 


It is helpful to introduce a special symbol for the right-hand side of this equation. 
The Kronecker delta symbol 6;; is defined by 


i: peed 
TS TE R (4.11) 
0 fors £2. 


The orthonormality of energy eigenfunctions is then expressed compactly as 
co 
J POO (4.12) 
6) 


We have been led to the orthonormality of energy eigenfunctions by considering 
implications of the overlap rule, but the energy eigenfunctions are solutions of the 
time-independent Schrédinger equation, so there must something special about 
the time-independent Schrédinger equation, guaranteeing that its normalized 
solutions obey Equation 4.12. Later in the course you will see that this is indeed 
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the case. For the moment, we shall simply accept that Equation 4.12 is valid; the 
following exercise asks you to verify it in one special case. 


Exercise 4.4 A one-dimensional infinite square well has its walls at x = 0 and 
x = L. The normalized energy eigenfunctions for this well are given by 


2 
Wal) = VEZ sin (=) forO<a<L (Eqn 3.22) 
vn(e) = 0 for x < Oand a > L. (Eqn 3.23) 
where n = 1,2,3,.... Verify that these eigenfunctions are orthonormal. E 


The overlap rule for a linear combination of stationary states 


The overlap rule is rather dull for stationary states because we already know that a 
measurement of energy in a stationary state is certain to give the corresponding 
energy eigenvalue. We now turn to the more interesting case of a linear 
combination of stationary states. Let us suppose that the system is described by 
the wave function 


P(x, t) = a(z, t) + agVo(z,t)+---, (4.13) 


where the W;(z, t) are stationary-state wave functions and the coefficients a; are 
constants. The stationary-state wave functions are of the form 


W; (x,t) = pile) e PHM", 
where w(x) is an energy eigenfunction. So, if we introduce the notation 
cilt) = age PNA, (4.14) 


the wave function in Equation 4.13 can be expressed as a linear combination of 
energy eigenfunctions: 


W(x, t) = c(t) d(x) + colt) polr) +. (4.15) 


Wave functions of this type describe states of indefinite energy. When the energy 
of the system is measured, a variety of different results may be obtained, each with 
its own probability. The overlap rule allows us to calculate these probabilities. 


To see how this works, let’s form the overlap integral of yı (x) with both sides of 
Equation 4.15. This gives 


| * bia) W(x, t) de = c(t) a ” h(a) da (2) da 


+ a(t) T yia) Yala) da +, 


where factors that are independent of x have been taken outside the integrals. By 

orthogonality, only the first of the integrals on the right-hand side is non-zero, and 
this is equal to 1 because the energy eigenfunctions are normalized. We therefore 

have 


E Poteoirsat: 


4.2 Probability distributions 


The overlap integral on the left is the probability amplitude for getting the value 
E; in an energy measurement carried out at time t. Taking the square of the 
modulus of this probability amplitude, the corresponding probability is 

2 


a pattie =la. 


A similar argument applies to any other energy eigenvalue, so we are led to the 
following rule. 


P= 


The coefficient rule 

We assume that the wave 
function and the eigenfunctions 
are normalized. 


If the wave function U(x, t) of a system is expressed as a linear combination 
of energy eigenfunctions, the probability of obtaining the ith eigenvalue Æ; 
is 


pi = |cx(t)|?,, (4.16) 


where c;(t) is the coefficient of the ith energy eigenfunction in the wave 
function at the instant of measurement. The coefficient c(t) can therefore be 
interpreted as a probability amplitude whose modulus squared gives the 
probability of measuring the value F;. 


For example, suppose that the wave function at the time of measurement is 
U(a, to) = 0.61 v1(z) = 0.8 palz), 


where Yı (x) and w2(a) are normalized energy eigenfunctions with eigenvalues 
E; and E. Then the probabilities of getting energies F4 and Eù are 


pı = |0.6i|? = 0.36 and py = |—0.8|? = 0.64, 


while the probability of getting any other eigenvalue (such as Æ} or E4) is zero 
because the corresponding eigenfunctions are absent from the wave function, and 
therefore have zero coefficients. 


It might appear that the probability given by Equation 4.16 is time-dependent, but 
this is not the case. Because the coefficients c(t) take the special form given in 
Equation 4.14, we have 


pi = ler = |a;e™iE:t/h]? = jul? JetBt/h |? = jail’, (4.17) 


which is clearly independent of time. In classical mechanics, the energy of any 
isolated system remains constant in time; this is the law of conservation of 
energy. In quantum mechanics, the system may have an indefinite energy, so it is 
generally meaningless to say that the energy of a system remains constant in time. 


However, we can say that an isolated system has a probability distribution of The probability distribution of 
energy that remains constant in time; this is the quantum-mechanical version of energy may change when a 
energy conservation. measurement is made, but 


any measurement involves 
interaction with a measuring 
instrument, so the system is no 
longer isolated. 


The coefficient rule can also be derived in a more concise way, using the 
Kronecker delta symbol. It is worth going through this alternative derivation 
carefully because it introduces some useful skills. The starting point is to express 
Equation 4.15 more compactly, using a summation sign: 


W(x, t) =Y ai vy (2). 
j 
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Taking the overlap integral of both sides with the ¿th energy eigenfunction then 
gives 


f @ ven ae oH [ vile) vj) ar 
—oo j —oo 
j 


because the eigenfunctions are orthonormal (Equation 4.12). Although the sum on 
the right-hand side contains many terms, nearly all of these terms are equal to zero 
because the Kronecker delta symbol vanishes whenever j Æ i. The only surviving 
term is that for which 7 = 7, and in this case the Kronecker delta symbol is equal 
to 1. We therefore conclude that 


T ie Oa dde= ae. 


Finally, taking the square of the modulus of both sides, and using the overlap rule, 
we obtain p; = |c;(t)|?, as before. 


When using the coefficient rule, it is important to remember that both the wave 
function and the energy eigenfunctions must be normalized. This imposes a 
constraint on the coefficients, which can be established using the Kronecker delta 
technique. We write 


(e,t) a o) (2) 


a 


= “TE (x) Wi(a). (4.18) 


Note very carefully that we have chosen different indices, 2 and j, in the two sums 
in this product. This is essential. If we had used the same index in both sums, we 
would incorrectly omit terms such as cj (t) co(t) Yï (x) w(x). The index used in 
any single sum is arbitrary, but you should always avoid using the same index in 
two sums that are multiplied together. 


Integrating Equation 4.18 over all x and using orthonormality of the energy 
eigenfunctions (Equation 4.12), we then obtain 


E W* (x,t) Væ t) de = Y Y E(t) cilt) 5 
te - = 


Most of the terms in the sum over j on the right-hand side are equal to zero. 
The only non-zero terms are those with 7 = 7, for which the Kronecker delta 
symbol is equal to 1, so the double sum reduces to the single sum ) 7, c} (t) c(t). 
Normalization of the wave function therefore requires that 


dole)? =1 


This makes good sense because the probability of getting the ith energy 
eigenvalue is p; = |c;(t)|?, and the sum of the p;, taken over all the energy 
eigenvalues, is the probability of obtaining one or other of the eigenvalues. 
This must be equal to 1 because we are bound to get one or other of these 


4.2 Probability distributions 


allowed energies. Because |c;(t)|? is independent of time, a wave function that is 


normalized at a given time remains normalized forever. 


Exercise 4.5 A wave function takes the form 
P(x, t) = A(2 u(x) e iEit/h 3 o9(x) en), 


where A is a constant and y(a) and w(x) are normalized energy eigenfunctions 
with eigenvalues F; and Es. Find a constant A that normalizes this wave 
function. What are the probabilities of finding the values F} and Eù in an energy 
measurement taken on this state? E 


The overlap rule in the general case 


If the wave function is presented as a sum of energy eigenfunctions, it is easy to 
calculate probabilities by reading off the coefficients and then taking their 
modulus squared. More commonly, however, the wave function is specified as a 
single function that has not been split into a sum of energy eigenfunctions. In such 
a case, we use the overlap rule directly. The following worked example shows 
how this is done. You will need to remember that the energy eigenfunctions in a 
symmetric one-dimensional infinite square well of width L take the form 


2 
Un(x) = 4/ = cos (=) for n = 1,3,5,..., (Eqn 3.29) 
L L 
2 
Yn(e) = y F sin (F) forn = 2,4,6,..., (Eqn 3.28) 
inside the well, with Yn (x) = 0 outside. 
Worked Example 4.1 Essential skill 
A particle of mass m is in a symmetric one-dimensional infinite square well Using the overlap rule 


from z = —L/2 to x = +L /2. At time t = 0, the state of the particle is 
given by the normalized wave function 


L? 
V(z,0) = a le = =) for —L/2 <2 < L/2. 


If the energy is measured at this time, what is the probability of obtaining 
the ground-state energy Fy = 1?h?/2mL?? 


Solution 
Using the overlap rule, the probability amplitude for energy Fj is 


Ore f e 


Da) yee TA oao I 
-2 B Jp = (e -=) dz. 


The integral can be evaluated by changing the variable of integration from 
x to y = 7x/L. The limits of integration change from x = +L /2 to 
y = +7/2, and dx = (L/7) dy, so we obtain 


INS 7/2 R L [2 
a0- | (2) f yP cosy dy — <x = f cosydy| . 


T —1/2 4 —7/2 


107 


Chapter 4 The Heisenberg uncertainty principle 


This useful result can save a lot 
of unnecessary work. 


108 


Then, using a standard integral given inside the back cover of the book, 


v60 (4) == I <l _ 4v6 


’ 


0) = 
ent) TÈ T 2 Ar T 
so the probability of energy F is 

p1 = |e1(0)|* = TEE 


Comment: This result shows that any measurement of the energy is very 
likely to give the value Æ, with just over one case in 1000 giving some 
other value. The answer makes good sense because the wave function is 
quite similar in shape to the ground-state energy eigenfunction, so there is a 
large overlap between them. 


Exercise 4.6 


(a) Show that the definite integral of any odd function f(x), taken over a range 
that is centred on x = 0, is equal to zero. 


(b) Use the result of part (a) to show that the probability of getting the energy 
eigenvalue F» in a state described by the wave function of Worked Example 4.1 is 
equal to zero. 


Exercise 4.7 What is the probability of getting the energy eigenvalue F} in a 
state described by the wave function of Worked Example 4.1? Oo 


4.2.2 Extension to other observables 


So far, we have concentrated on energy measurements in a one-dimensional 
infinite well, but the ideas introduced above can be extended to other quantities 
and to other systems. The purpose of this subsection is to sketch how this is done. 
Subsequent sections of the chapter do not rely on this material, so you should read 
it quickly, without lingering on the details; the main point to grasp is that the 
overlap rule is a general principle of quantum mechanics, and is not confined to 
energy measurements. 


Let us consider an observable quantity A, which need not be energy. In quantum 


mechanics, A is represented by a linear operator A, and we can construct the 
eigenvalue equation 


We shall assume that the eigenvalues a; are discrete, and that each corresponds to 


a single eigenfunction ¢;(x). Then the eigenvalues are the possible values of the 
observable A, and the probability of obtaining the eigenvalue a; is given by 


2 


1s E d (a) V(x, t) dal , (4.19) 


where W(x, t) is the wave function of the system at the time of measurement, and 
¢;(2) is the eigenfunction of A with eigenvalue a;. This extends the overlap rule 


4.2 Probability distributions 


to observables other than energy. The important point to note is that it involves the 
eigenfunctions of the observable A that is being measured. 


It is also possible to extend the coefficient rule. Suppose that the wave function of 
the system is expressed as a linear combination of the eigenfunctions @;(x) of A: 


WU (x,t) = `> cilt) d(x). 


a 


Then a measurement of A, carried out at time t, will yield the eigenvalue a; with 
probability 


pi = |e (t)|?. (4.20) 


In general, the coefficients c;(t) will be complicated functions of time, and the 
probabilities p; will depend on time. However, the important point to note is that 
the coefficient rule can be used for observables other than energy, provided that 
the wave function is expressed as a linear combination of the eigenfunctions of the 
observable A that is being measured. 


The above descriptions assume that the eigenvalues of A are discrete. However, it 
must be admitted that many observables do not have purely discrete eigenvalues. 
For example, position and momentum are not quantized. If we measure the 
position of a particle, we might obtain any value in a continuous range. A 
different type of description is needed for observables with continuous 

values. Precise measurements of continuous variables make no sense from an 
experimental point of view. In practice, one cannot measure the exact value of a 
continuous variable, but only its value to within some finite resolution. When a 
Geiger counter clicks, for example, it tells us only that the particle is somewhere 
inside the counter; it does not reveal the exact point occupied by the particle. 


It is actually rather pointless to talk about the probability of detecting a particle at 
a single point, P. An infinite number of points lie arbitrarily close to P, so the 
chances of finding the particle exactly at P are completely negligible. It is much 
more sensible to talk about the probability of finding the particle in a small region 
around P. This is what Born’s rule does. In one dimension this rule states that the 
probability of finding the particle in a small interval of length ôx, centred on the 
point x, is |Y (x, t)|? 6x. In other words, the probability density for position (the 
probability per unit length) is |W(z, t)|?. At first sight, this looks very different 
from the overlap rule, but this is not really so. 


To express Born’s rule in terms of a (modified) overlap rule, we introduce the 
‘top-hat’ function Ty, (x) shown in Figure 4.7. This function has a constant value 
1/,/w throughout a small range of width w, centred on the point xo, and is equal 
to zero outside this range. The top-hat function is normalized because 


oe) 3 rots 1 2 
J e= 
—oo to- 


—| dr=—xwe=l. 
w 
If we take the overlap integral of a very narrow top-hat function with the wave 


Ju 


H 


function U(x, t), we can make the approximation Figure 4.7 A top-hat function 
o0 1 of width w, centred on the 
/ Trot) Va, t) da = va U(xo,t) x w = V(20,t) Vw, point x9. The (informal) name 
—oo 


of this function comes from its 
and this approximation can be made as accurate as we like by taking a top-hat shape! 


function that is narrow enough. Setting w = dx, and taking the square of the 
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modulus of both sides, we therefore have 


oo 2 
i Tz (£) ¥(z,t)dz| = |T (xo, t)|? dx. (4.21) 


According to Born’s rule, this is the probability of finding the particle within a 
small interval of width ôx, centred on the point x. Born’s rule can therefore be 
regarded as a modified version of the overlap rule, applied to a high-resolution 
measurement of position. (The only modification is that the left-hand side of 
Equation 4.21 involves a narrow top-hat function, rather than an eigenfunction 
arising explicitly from an eigenvalue equation, but this can be regarded as a 
technicality.) 


Equation 4.21 is not useful for calculations; if you want to find the probability 

of detecting a particle in a small region, you should use Born’s rule directly. 
However, the focus of this subsection has been mainly conceptual. Its purpose is 
accomplished if you remember one thing: the overlap rule is a universal principle, 
which applies to all measurements in wave mechanics, not just those involving 
energies. 


4.3 Expectation values in quantum mechanics 


The probability distribution for a given observable can be represented graphically 
— either as a bar chart of probabilities for a discrete variable, or as a graph of 
probability density for a continuous variable (Figure 4.8). 


A probability A probability density 
| | E : > 
O Ei Ez Ez E 0 £ 


(a) (b) 


Figure 4.8 (a) A bar chart showing probabilities of discrete energy values. 
(b) A graph showing the probability density of position. 


Probability distributions like these summarize the extent to which quantum 
mechanics can predict the likely results of measurements. However, for many 
purposes, it is convenient to give a less detailed description, characterizing 
the probability distribution by two measures — its expectation value (which 
represents the mean value of the distribution) and its uncertainty (which 
represents the spread in values around the mean). This section will discuss 
expectation values. 


4.3 Expectation values in quantum mechanics 


4.3.1 Mean values and expectation values 


Suppose that an observable quantity A has a discrete set of possible values, 
@1,@2,.... Then we can prepare N identical systems, all in the same state, and 
measure the value of A in each system. In general, we will get a spread of results. 
Let’s suppose that result a; is obtained in n; of the measurements. Then we can 
form the quantity 


ZF May + n2a2 +: + NNAN _ Ni) 
A= x = 2 (>) tp. (4.22) 
— 


This is the mean value of A taken over the set of N measurements. The quantity 
n;/N is the relative frequency of result a;; since ni = N, the sum of the 
relative frequencies is equal to 1. 


The expectation value of an observable A is the quantum-mechanical prediction 
for the mean value. It is denoted by the symbol (A) and obtained by replacing the 
relative frequencies in Equation 4.22 by quantum-mechanical probabilities. Thus 


QS Sy eas (4.23) 


where p; is the probability of result a;. It is in the nature of chance that the mean 
value A, taken over a finite number of measurements, may deviate slightly 

from the expectation value (A). However, in the long run, as the number of 
measurements tends to infinity, the mean value is expected to converge to the 
expectation value. So a quantum-mechanical prediction of the expectation value 
can be compared with experiment. 


The expectation value of an observable can be readily calculated if we know its 
probability distribution. For example, suppose that a system is in the state 
described by the normalized wave function 


1 ~iFyt/h —~iBot/h 

— Te MT gje = ) ; 

z (va) talx) 

where p(x) and Y2(x) are energy eigenfunctions with eigenvalues Æ; and E2. In 
this case, the coefficient rule shows that the probabilities of obtaining the values 
FE, and Es are pı = 1/2 and pə = 1/2. All other values have zero probability, so 
the expectation value of the energy in this state is 


Ei + £2 

2 bj 
which is midway between the two allowed energy values. Note that this 
expectation value is not one of the allowed energy values. A single energy 
measurement gives either F or E», but never (E1 + E2)/2. This is not surprising 
because the mean value of a set of numbers need not be equal to any of the 
numbers in the set. One is reminded of the fact that an average family in the UK 
contains 1.7 children, although no real family has such a composition. 


P(x, t) = 


(E) = pi Ey + poke = 


Exercise 4.8 Find the expectation value of energy in the state described in 
Exercise 4.5, expressing your answer in terms of the energy eigenvalues F4 and 
Ep. E 
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The operator A acts on Ù (x, t). 


The resulting function is then 
multiplied by Y* (x,t) and this 


product is integrated over all x. 
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We can also define the expectation values of A”, A3, or any other power of A by 
using appropriate powers of the eigenvalues. For example, 


(A?) = 9 pia, (4.24) 


where the sum runs over all the possible values a;. 


Expectation values can also be defined for observables with a continuous set of 
possible values, such as position. The probability of obtaining a value of position 
in the small interval between x — 62/2 and x + dx/2 is given by 


probability = | W(x, t)|? dz, 


where |Y (x, t)|? is the probability density for position at the time of measurement. 
The expectation value of position is then defined by the integral 


a) =f W(a,t) Pade, (4.25) 
and the expectation value of x? is given by 
a?) = | iW (z, t)? 2? de. (4.26) 


Comparing with the discrete case (Equations 4.23 and 4.24), we see that a sum 
over allowed values weighted by probabilities is replaced by an integral over 
allowed values weighted by a probability density function. 


4.3.2 The sandwich integral rule 


Our expression for the expectation value of an observable (Equation 4.23 in the 
discrete case) can be evaluated if we know all the possible values and their 
probabilities. In principle, the probabilities can be found using the methods of 
Section 4.2, but this could involve a great deal of work, because the number of 
possible values could be vast, or even infinite. Fortunately, there is another way of 
calculating expectation values, using the following rule. 


The sandwich integral rule for expectation values 


In a state described by the wave function Y (x, t), the expectation value of an 
observable A at time t is given by 


(oe) 
(4) = f U*(2,t) A(z, t) da, (4.27) 
—oo 

where A is the quantum-mechanical operator corresponding to A. The 
operator A is ‘sandwiched’ between the two functions Y* (x,t) and ¥ (x,t); 
for this reason, we informally call the integral on the right-hand side a 
sandwich integral. 


It is easy to see that the sandwich integral rule works for the expectation value 
of position. The position operator X simply tells us to multiply by z, so 
Equation 4.27 is just Equation 4.25 in disguise. 


4.3 Expectation values in quantum mechanics 


Equation 4.27 can be justified in other cases as well. As an illustration, we will 
calculate the sandwich integral 


1 W* (x,t) H(z, t) dx 


—oo 
for a system with discrete, non-degenerate energy eigenvalues. 


Let’s assume that the wave function U(x, t) is a linear combination of 
stationary-state wave functions, which means that 


W(x, t) = 2 or, Wi(w) eh ` ci(t) pils), (4.28) 


a 


where c;(t) = aje"/", and y(x) is an energy eigenfunction with 
eigenvalue E;. Then applying the Hamiltonian operator to this wave function 
gives 

HW(a,t) =} at) Ryle) = J alt) Bi yila), (4.29) 

i i 

where we have used the fact that H is a linear operator to move it inside the 
summation sign, and remembered that 7;() is an eigenfunction of H with 
eigenvalue Fj. 


Inserting Equations 4.28 and 4.29 into the sandwich integral gives 


T W* (x,t) HU (a, t) dx 


= A (al we) (Zat) E; dila) ) A 
j i 


Note carefully that we have used different indices, 7 and 7, in the two sums in the 
integrand. The reason for this was explained on page 106, in the context of 
normalizing a wave function; if we did not take this precaution, “cross-product’ 
terms would be incorrectly omitted. 


The integral on the right-hand side may look complicated, but it can be simplified 
by interchanging the summation and integral signs, and taking terms that do not 
depend on zx outside the integral. This gives 


a T (x,t) Ñ T(z, t) dr = Y (t) c(t) E; ‘a 5 (2) vy(a) dz. 
oo ij —oo 


We can use the orthonormality of the energy eigenfunctions to replace the 
remaining integral by a Kronecker delta symbol 0;;. This removes all terms with 
i Æ j, so the double sum over i and j becomes a single sum over i: 


T W* (a, t) HW(e,t) de = F f(t) alt) Eida = Y et (t) clt) Bi. 
TeL ij i 


Finally, the coefficient rule allows us to interpret & (t) ¢;(t) = |c;(t)|? as the 
probability p; of obtaining the eigenvalue FE, so we conclude that 


T W* (x,t) T(z, t)dz = J pik; = (E), 
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Using the sandwich integral rule 
to calculate expectation values 
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as required. 


This justification of the sandwich integral rule was based on a special case, but the 
sandwich integral rule is valid for any observable in any state in wave mechanics. 
For example, the expectation value of momentum is given by 


(pas 1 * wat) (-in =) TET (4.30) 


—co 


and the expectation value of p2 is 


(p2) = T T(z, t) (-in Z) we. ae 


—oo 
oo 0? 
= -r f P* (x,t) — Y (z, t) de. (4.31) 
e Ox 
Because observables are represented by linear operators, it is safe to assume that 
(aA + GB) = a(A) + 6(B) (4.32) 


for any constants a and (3. However, care is needed when dealing with powers of 
observables. In general, 


(A?) # (A)?, (4.33) 


as you will see in the following worked example. 


Worked Example 4.2 


Find the expectation values of x and x? in the ground state of a symmetric 
one-dimensional infinite square well, with walls at x = —L/2 and 
g=+L/2. 


Solution 


The ground-state wave function vanishes outside the well. Inside the well 
(for —L/2 < x < L/2), itis 


i 
W(t) = 78 (F) gue 


Using the sandwich integral rule (Equation 4.27), the expectation value of x 
is given by 


g pae T N Ume TX i 
eae ify t/h —if\t/h 
(x) T J, cos (= Je | £ cos (= Je | dz. 


The time-dependent phase factors combine to give 1, and we are left with 


g pae g ff WHE 
(o Eip (=) da: 


This integral vanishes because the integrand is an odd function of x and the 
range of integration is centred on the origin. Hence 


(oO 


4.4 Uncertainties in quantum mechanics 


Proceeding in a similar way, the expectation value of x? is 


a AGE 

2 2) 9 

i) — = x” cos’ | — } dz. 
ey i Jije ey 


Changing the variable of integration to y = mg/L, we obtain 


2 L 3 T2 
UF == (=) / y? cos? y dy. 
L T —T/2 
Finally, using a standard integral given inside the back cover of the book, we 
obtain 


_ DIP a T 
= 


Ge = z) = 00T. 


ZE 
Clearly, (a?) 4 (x)?. 


Exercise 4.9 Find the expectation values of py and p? in the ground state of a 
symmetric one-dimensional infinite square well, with walls at x = —L/2 and 

x = +L/2. 

Exercise 4.10 A particle of mass m is in a symmetric one-dimensional infinite 
square well of width L. Find the expectation value of the energy of this particle in 
a state described by the normalized wave function 


L? 
a0) = 2 (« — =) for —L/2 < x < L/2, 


with ¥(x,0) = 0 outside the well. m 


4.4 Uncertainties in quantum mechanics 


The two sets of data values in Figure 4.9 below are very different, even though 
they have the same mean value; those in Figure 4.9b are much more spread out 
than those in Figure 4.9a. For many purposes, it is useful to have a measure of the 
amount by which a set of data values spreads out on either side of the mean value. 
The standard deviation o(A) gives us such a measure. 


aprobability a probability 
o(A) 


i 
à i p wlll Miia E 


ae a 0 aE a 
(a) A (b) A 


Figure 4.9 A set of data values with (a) a small spread and (b) a large spread. 
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For each value a;, we can form its deviation from the mean value, (a; — A). 

We might try to measure the spread of values by taking the average value of 
these deviations. Unfortunately, this is useless because the average value of the 
deviations is equal to zero. (The negative deviations and the positive deviations 
exactly cancel out.) The standard deviation, o( A), is defined by taking the 
squares of the deviations before taking the average, and then taking the square 
root of the result. This produces a quantity with the same units as A. The squares 
of the deviations add cumulatively, without any cancellation, so o( A) gives a 
sensible measure of the spread of values of A. 


Just as an expectation value is a theoretical prediction for a mean value, so an 
uncertainty is a theoretical prediction for a standard deviation. For a system in a 
given state Y, the uncertainty AA of an observable A is defined by 


AA = ((A- (A))2)"?, (4.34) 


where the expectation values are calculated in the state W. If many measurements 
of A are taken on identical systems all in the state Y, then the standard deviation 
a(A) of the results is expected to converge to the uncertainty AA as the number 
of measurements tends to infinity. So a quantum-mechanical prediction for the 
uncertainty can be compared with experiment. 


The uncertainty can also be expressed in a more convenient form. Squaring both 
sides and expanding the round brackets in Equation 4.34 gives 


(AA)? = (A? — A(A) — (A)A + (A)?). 

Remembering that (A) is a constant, and using Equation 4.32, we obtain 
(AA)? = (A?) — 2(A)(A) + (A)? = (A?) — (A)?, 

and hence 


AA = ((A2) aAa (4.35) 


This formula is an efficient way of calculating uncertainties, with the expectation 
values of A and A? found by using the sandwich integral rule. 


Worked Example 4.3 


Calculate the uncertainties of position and momentum in the ground state of 
a one-dimensional infinite square well of width L, centred on the origin. 


Solution 


It simplifies the working to consider the square of the uncertainties until the 
last step in the calculation. For position, 


(Az)? = (x") — (x)?. 
From Worked Example 4.2, we have (x) = 0 and (x?) = 0.0327 L?, so 
Ar = VOMIT (ele, 


For momentum, 


OARRA (pr). 


4.5 The Heisenberg uncertainty principle 


From the solution to Exercise 4.9, we have (pr) = 0 and (p2) = 17h? /L?, 


so 
ne [n?h? T 
Px = o T 
Exercise 4.1! Show that any stationary state has zero uncertainty in energy. 


Exercise 4.12 A system is in a linear combination of stationary states 
described by the wave function 


W(2,t) = Fle) + V2(2,t)), 


where Yı (x,t) and Y2(x,t) are stationary states with energies E; and E2. What 
is the uncertainty in energy in this state? a 


4.5 The Heisenberg uncertainty principle 


The basic principles of quantum mechanics were worked out in a remarkably 
short period, from 1925 to 1927. Towards the end of this revolution, Heisenberg 
(Figure 4.10) proposed his celebrated uncertainty principle. It is possible to prove 
the uncertainty principle from basic quantum-mechanical rules, and this will be 
done later in the course. Here, we shall concentrate on its meaning, interpretation 
and applications, and on how it is used to make order-of-magnitude estimates. 


The Heisenberg uncertainty principle can be stated as follows: 


The Heisenberg uncertainty principle 


In any system, in any state, the uncertainties of position and momentum 
components obey the inequality 


Az Ap, > ` (4.36) 


This inequality involves position and momentum components along the 
same axis. Similar results apply to the uncertainty products Ay Apy and 
Az Ap, but a product such as Az Apy, which involves position and 
momentum components along different axes, does not obey an inequality 
and can, in principle, be equal to zero. 


Figure 4.10 Werner 
Heisenberg (1901-1976) 
proposed the uncertainty 
principle in 1927. 


You have seen how the uncertainties Az and Ap, are defined and calculated. 
The important thing to remember is that these uncertainties refer to a particular 
quantum-mechanical state. Even though the state is known precisely, we still 
obtain a spread of values when we measure position, and another spread of 
values when we measure momentum. These spreads in values correspond to 
uncertainties that must obey the Heisenberg uncertainty principle. For example, 


i 
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using the results of Worked Example 4.3, we see that the uncertainties in the 
ground state of a one-dimensional infinite square well satisfy 

h 
Az Ap, = 0.18L x a = 0.57h > z 


which is consistent with the uncertainty principle. 


The uncertainty principle tells us that it is impossible to find a state in which 

a particle has definite values of both position and momentum. It therefore 
demolishes the classical picture of a particle following a well-defined trajectory. 
This must have been very satisfying to Heisenberg, who was driven by the 
conviction that the orbits of Bohr’s atomic model had no part to play in quantum 
mechanics. As early as 1925, Heisenberg wrote: 


‘my entire meagre efforts go towards killing off and suitably replacing the 
concept of orbital paths that one cannot observe.’ 


With the uncertainty principle, he effectively barred any attempt to revive 
classical orbits. Instead, wave mechanics describes the state of a system by a 
wave function, which implies a spread of position and momentum values. The 
uncertainty principle requires that a narrow spread in one of these quantities is 
offset by a wide spread in the other. 


It is important to remember that the uncertainty principle is an inequality, not an 
equation. If Ax is small, it follows that Ap, is large — at least as large as 

h/(2 Ax). However, if Ax is large, the uncertainty principle does not guarantee 
that Apy is small. It only tells us that Ap, is greater than or equal to h/(2 Ax), 
which is a weak constraint when Az is large. In fact, it is perfectly possible for 
both Az and Ap; to be large. This is consistent with the uncertainty principle and 
turns out to be quite common in states with high quantum numbers. Nevertheless, 
states with low quantum numbers usually have a product of the position and 
momentum uncertainties that is close to the minimum value of fi/2. For this 
reason, physicists sometimes suppose that Ax Ap, ~ h/2 for the lowest levels in 
a system. Unlike the uncertainty principle, this statement is not rigorous, but it 
can be used as a rough rule of thumb. 


Interpretation 


The uncertainty principle is sometimes interpreted in terms of the disturbance 
produced by a measurement. For example, we know that a measurement of 
position disturbs the system, and it is tempting to suppose that this ‘explains’ 
why position and momentum cannot both have definite values at the same 
time. However, such interpretations seem to miss the main point. It is true that 
measurements create disturbances, and that the state of a system is generally 
altered by a measurement. But this is not the subject of the uncertainty principle. 
The uncertainty principle tells us nothing about the disturbance caused by a 
measurement: instead, it tells us about the indeterminacy that is inherent in a 
given state before the measurement is taken. One way of demonstrating this 
indeterminacy is to prepare many identical systems in identical states. If we 
measure position in some of these systems, and momentum in others, we will 
get spreads of values of position and momentum that are consistent with the 
uncertainty principle. However, this reflects the nature of the state before any 
measurements are taken, not the disturbance that the measurement creates in 
individual systems. 


4.5 The Heisenberg uncertainty principle 


4.5.1 The uncertainty principle in action 


A striking illustration of the uncertainty principle is provided by a gas of atoms 
that has been cooled down to a very low temperature. In 1995, Eric Cornell and 
Carl Wieman cooled a sample of about 2000 rubidium-87 atoms to tens of 
nanokelvins. In order to keep the sample cool, it was trapped in a small volume of 
empty space by magnetic fields and beams of light; this kept it away from the 
relatively warm walls of the container. Figure 4.11 shows the density profile 

of the cloud as it was cooled below 400 nK. The peak that emerges below 

this temperature reveals a remarkable phenomenon, known as Bose-Einstein 
condensation, in which a large proportion of the atoms suddenly occupy a single 
quantum state, the ground state of the well in which they are trapped. 


50 nk 


200 aK 


400 nk 


Figure 4.11 Density distributions of 87Rb (rubidium-87) atoms below 400 nK 
reveal Bose-Einstein condensation. Cornell and Wieman received the 2001 Nobel 
prize for physics for this work. 


By allowing the gas cloud to expand freely in a horizontal direction and taking 
snapshots of its shape, a measure could be gained of its momentum distribution. 
The spread of momenta in the peak was found to be close to the minimum allowed 
by the uncertainty principle. On closer inspection, the peak was observed to be 
anisotropic, with a greater spread of momentum in one direction than another. 
This was explained by the fact that the trapping region was itself anisotropic. The 
direction with the widest spread of momentum was that with the narrowest spatial 
confinement, just as one would expect from the uncertainty principle. 


Another example of the uncertainty principle in action is provided by liquid 
helium. At atmospheric pressure, helium remains liquid no matter how much it is 
cooled, and this is largely due to the uncertainty principle. Each helium atom can 
be thought of as being trapped in a small cage formed by its nearest neighbours. It 
therefore has a small uncertainty in position, which must be accompanied by a 
large uncertainty in momentum. It follows that a typical helium atom in the liquid 
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Figure 4.12 A weakly-bound 
eigenfunction near the top of a 

potential energy well can extend 
far beyond the attractive region. 
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has a significant momentum magnitude p, and a significant kinetic energy p*/2m, 
even at absolute zero. Because helium atoms have low mass m and bind together 

very weakly, this kinetic energy is enough to disrupt the formation of a solid. Any 
clumps of solid helium that do manage to assemble are rapidly shaken to pieces so 
the helium remains liquid. 


4.5.2 Making estimates with the uncertainty principle 


If we know how a particle is confined, we can generally make a rough estimate of 
its uncertainty in position. For example, we might guess that a particle trapped in 
a square well of width L has Ag ~ L/2. Actually, this is not a very good 
estimate. The uncertainty of any quantity is defined by Equation 4.35, and you 
have seen that this gives Ax = 0.18Z in the ground state of an infinite square 
well. This is significantly less than L/2 because the ground-state eigenfunction 
tails off towards the edges of the well. In a deep finite square well, where the 
penetration into the classically forbidden region is small, the uncertainty of 
position in the ground state is still of order 0.182. Using this value as a guide, and 
recognizing that (pz) = 0, we have 
5 b h T 
(p3) = (Aps) > (i) ~-r: 


In a three-dimensional well with smallest linear dimension L, the average kinetic 
energy of a particle of mass m obeys 

2 (p2) 2 3 2 23h2 
The lower bound of this inequality is comparable with the ground-state energy 
37°h? /2mL? of a particle in a three-dimensional cubic box, obtained by solving 
Schrédinger’s equation. 


Exercise 4.13 Estimate the minimum value of the average kinetic energy of 
an atom in liquid helium at a temperature that is low enough for any thermal 
energy to be neglected. The typical spacing between two atoms in liquid helium is 
3.5 x 10710 m, and the mass of a helium atom is 6.7 x 107?" kg. E 


Our estimate Ax ~ 0.18L is reasonable for states near the bottom of a square 
well, but becomes less reliable near the top of the well. Figure 4.12 shows an 
eigenfunction very near the top of a square well. This function extends far beyond 
the confines of the well because the particle penetrates deep into the classically 
forbidden region. Such an eigenfunction represents a bound state — a state in 
which the particle is trapped somewhere in the vicinity of the well rather than 
escaping to infinity. However, in spite of being trapped, the particle is quite likely 
to be found outside the well, and has Ax >> 0.18L. 


States like this really do exist. For example, a lithium-11 nucleus is five times 
larger than normal because two of its neutrons occupy a sort of halo around the 
central core. The wave function of these neutrons extends far beyond the range of 
the nuclear force that keeps them bound to the nucleus. In solid-state physics, two 
electrons in a superconductor can join together to form an entity called a Cooper 
pair. The wave function of a Cooper pair extends over a distance that is at least 
100 times larger than the range of the force that binds the electrons together. 


4.5 The Heisenberg uncertainty principle 


Because it is may not be obvious what value to take for Az, an effective way of 
using the uncertainty principle is to treat Az as an adjustable parameter and to 
vary it in order to find the minimum energy consistent with the uncertainty 
principle. This minimum energy is usually a good approximation to the 
ground-state energy of the system. The following worked example shows how this 
method works. 


Worked Example 4.4 Essential skill 
In classical physics, a particle undergoing simple harmonic motion has total Using the uncertainty principle 
energy to estimate energies and 
2 uncertainties in the ground state 
(eee Imwir’, 


where m is the mass of the particle and wọ is the angular frequency of the 
oscillation. Use the uncertainty principle to estimate the energy of the 
ground state and the uncertainties in position and momentum in the ground 
state. 


Solution 


Let the uncertainty in position be Ax = s; then the uncertainty in 
momentum obeys Ap, > h/2s. The potential energy well is symmetric 
about the origin, so (x) = (pz) = 0 and the uncertainties in position and 
momentum provide typical values of position and momentum. It follows 
that a typical value of the total energy obeys the inequality 


ae he 
B > —— (=) ap imus? ETE = mws’. 
The minimum value of the right-hand side is the smallest energy consistent 
with the uncertainty principle; we can take this minimum value as giving a 
good estimate of the ground-state energy. Denoting the function on the 
right-hand side by f(s), we can minimize f(s) by the usual methods of 
calculus. This involves taking the derivative of f(s) with respect to s, and 
setting the result equal to zero. We obtain 


2 
= a 
+ 53mw os ) ETE + mwgs = 0, 


so s? = ħ/2mwo, and the ground-state energy is estimated to be 


h? 2mwo h 
Emin = F Imu 7 = shu. 


sm h 0 2mw 


The uncertainty in position in the ground state is Ax = s = (A/2mw)"?. 


Finally, because we are dealing with a low-lying state, the uncertainty in 
momentum can be roughly estimated by taking the lowest value allowed by 
the uncertainty principle. This gives Ap, ~ h/2s = (hmwo/ Oe 4 


These estimates turn out to be exactly correct (as you will see in the next 
chapter). This lucky coincidence does not extend to other systems, but the 
method remains useful in providing order-of-magnitude estimates when 
Schrödinger’s equation is difficult to solve directly. 
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£o is a fundamental constant that 
determines the strength of 
electrostatic forces; it is called 
the permittivity of free space. 
The values of some important 
physical constants are listed 
inside the back cover. 
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Exercise 4.14 The classical expression for the total energy of an electron 
interacting with a proton in a hydrogen atom is 


O B+E æ 


E 
2m Aregr’ 


where r is the distance between the electron (of charge —e) and the proton (of 
charge e). Taking the typical distance of the electron from the proton to be s, and 
assuming for simplicity that Az = Ay = Az ~ s, estimate the ground-state 
energy and radius of a hydrogen atom. m 


Summary of Chapter 4 


Section 4.1 In wave mechanics, the state of a system is completely specified by 
its wave function. The time-evolution of the wave function is fully predicted by 
Schrödinger’s equation. Nevertheless, quantum mechanics is indeterministic 
because the result of measuring an observable in a state described by a given wave 
function is unpredictable. The act of measurement drastically alters the state of 
the system, so the wave function after the measurement is not the same as the 
wave function before the measurement. To test the predictions of quantum 
physics, we often take a large number of identical systems all in the same state, 
and perform the same measurement on each system. 


Not everything in quantum mechanics is unpredictable. Every energy 
measurement yields one or other of the energy eigenvalues. In a stationary state 
corresponding to the energy eigenvalue F;, we are certain to obtain the energy Fj. 


Section 4.2 We consider a system with a discrete set of non-degenerate energy 
eigenvalues (a particle in a one-dimensional infinite well). If the energy of a 
system is measured at time t, then the probability of obtaining the ith energy 
eigenvalue F; is given by the overlap rule 


2 


Pi = [fe W(x, t)dz| , 


where Ẹ (x,t) is the wave function of the system at the instant of measurement, 
and 7;(x) is the energy eigenfunction with eigenvalue E;. Both the wave function 
and the eigenfunction are assumed to be normalized. For this rule to be consistent, 
energy eigenfunctions with different eigenvalues must be orthogonal to one 
another. 


If, at the time of measurement, the wave function is a linear combination of 
energy eigenfunctions, then the probability of obtaining the energy eigenvalue F; 
is given by |c;(t)|?, where c;(t) is the coefficient of the energy eigenfunction 

wi (a) in the wave function. 


These rules can be extended to other observables and systems. 


Section 4.3 If an observable A has a discrete set of possible values, the 
expectation value of A in a given state Y is defined by 


(A) = Dy Pili, 


Achievements from Chapter 4 


where p; is the probability of obtaining value a; in the state W. Expectation values 
can be calculated using the sandwich integral rule 


(A) = i W* (x,t) A U(a, t) de. 


Sections 4.4 and 4.5 The uncertainty of an observable A is defined by 
AA = ((A— (A))?)"? = ((A2) = (Ap). 


With this definition, the Heisenberg uncertainty principle states that 


Az Ap; > - 


Achievements from Chapter 4 


After studying this chapter you should be able to: 
4.1 Explain the meanings of the newly defined (emboldened) terms and 
symbols, and use them appropriately. 


4.2 Recall that the allowed values of a discrete observable are given by its 
eigenvalues. 


4.3 Use the overlap rule and/or the coefficient rule to calculate the probability 
distribution of an observable in a given state. 


4.4 Recall that the overlap rule can be extended to an observable with a 
continuum of possible values. 


4.5 Define the expectation value of an observable in a given state. Calculate 
expectation values in simple cases. 


4.6 Define the uncertainty of an observable in a given state. Calculate 
uncertainties in simple cases. 


4.7 State and explain the meaning of the Heisenberg uncertainty principle. Use 
the uncertainty principle to make order-of-magnitude estimates. 
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Chapter 5 Simple harmonic 
oscillators 


Introduction 


Oscillations are a familiar type of motion. A guitar string, the suspension system 
of a car and a pendulum all oscillate in various ways. If the amplitude is small, it 
is generally a good approximation to suppose that the oscillation is simple 
harmonic, with a displacement that depends sinusoidally on time. Any system 
that performs motion of this sort is called a simple harmonic oscillator, or a 
harmonic oscillator for short. 


Oscillations are also common in the microscopic world. For example, a nitrogen 
molecule consists of two nitrogen atoms separated by an equilibrium distance of 
1.09 x 10710 m. But molecules are not rigid. From a classical perspective, we 
can picture a nitrogen molecule as stretching and compressing, producing an 
oscillation similar to that of two balls joined by a spring. Atoms in solids also 
oscillate around equilibrium positions, and some atomic nuclei wobble about 
equilibrium shapes. Moreover, sinusoidal waves are closely related to sinusoidal 
oscillations, so the quantum physics of waves, including light, is generally based 
on the behaviour of harmonic oscillators. The founders of quantum mechanics 
knew that oscillations were important. When Heisenberg took the first tentative 
steps towards quantum mechanics in 1925, the first system he tackled was the 
harmonic oscillator. 


The physics of harmonic oscillators has two aspects. Obviously, there is the 
oscillation itself — a time-dependent process in which something swings back 
and forth; in classical mechanics, we concentrate on this almost exclusively. In 
quantum mechanics, however, another issue must be tackled first. We need to 
investigate the stationary states and energy levels of the system. This requires us 
to solve the time-independent Schrédinger equation for a harmonic oscillator and 
interpret its solutions. 


In a stationary state Y, (x,t) = Up(x)e'”""/", the probability density 


|W,,(x, t)|? is independent of time. So a stationary state describes a sort of 
“suspended animation’ — it has a definite energy, but it does not provide a 
description of motion through space. This seems strange because we have 

no direct experience of stationary states in everyday life, but the physics of 
microscopic systems, such as atoms or molecules, is dominated by them. 

For example, when molecules collide with one another or absorb or emit 
electromagnetic radiation, their stationary-state wave functions determine the 
transitions that are likely to occur. The transitions between vibrational energy 
levels produce infrared spectra and influence the rates of chemical reactions; it has 
also been suggested that they affect our sense of smell. 


It turns out that the to-and-fro motion observed in a simple harmonic oscillator is 
well described by linear combinations of stationary states. This is the main 

topic of the next chapter (Chapter 6); the present chapter deals only with single 
stationary states. It is organized as follows. Section 5.1 gives a brief description of 
classical simple harmonic motion. Section 5.2 gives a quantum-mechanical 
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description; it introduces the time-independent Schrodinger equation and 
describes its solutions, the energy eigenfunctions, together with the energy 
eigenvalues. These eigenfunctions and eigenvalues are justified in Section 5.3, 
which also introduces some useful new tools — raising and lowering operators. 
Finally, in Section 5.4, we calculate expectation values and uncertainties for a 
harmonic oscillator, and derive a selection rule that helps explain the spectral lines 
emitted by vibrating molecules. 


5.1 Classical harmonic oscillators 


It is sensible to begin by reviewing the classical physics of simple harmonic 
motion. This will allow us to introduce some terminology and provide a reference 
against which the quantum-mechanical description can be compared. 


Consider a particle of mass m that moves to and fro along the x-axis, with a 
position of stable equilibrium at the origin, x = 0. When the particle moves away 
from the origin, it feels a force pulling it back towards the origin. In simple 
harmonic motion, the force is proportional to the displacement: 


F, =—Cz. (5.1) 


This is Hooke’s law. The constant C is called the force constant, and x is the 
displacement of the particle from the equilibrium position at x = 0. The minus 
sign ensures that F} is a restoring force — one that pulls the particle back 
towards its equilibrium position. 


In classical physics, the motion is governed by Newton’s second law 


d?r 
T de = Fy = —Cr, 
which can be rearranged to give 
d?r 
ane wêr = 0, (5.2) 
where iis T 


C 
wo = (5.3) 


is a constant. The solution of Equation 5.2 can be written as 


=y 


—A 
x(t) = Acos(wot + 4), (5.4) 
which describes a sinusoidal oscillation, as illustrated in Figure 5.1. The Figure 5.1 The displacement 
to-and-fro motion of the particle is shown by the fact that the graph oscillates of a particle undergoing classical 
above and below x = 0. There is no damping in our model, so the oscillation simple harmonic motion. 


continues indefinitely, each cycle of period T an exact repetition of the last. 


In addition to the parameter wo that characterizes the oscillator, Equation 5.4 


contains two arbitrary constants A and ¢. This number of arbitrary constants is Note that wp is not an 
appropriate for the general solution of a second-order differential equation. The arbitrary constant because it is 
meaning of all these constants is as follows. determined by the system 


í ; aaa o : . parameters m and C. 
e Ais the amplitude of the oscillation. This is the magnitude of the maximum 


displacement of the particle from its equilibrium position. 
125 


Chapter 5 Simple harmonic oscillators 


Ay 


Et 2 


RY 


0 


Figure 5.2 A simple 
harmonic potential energy well. 
The zero of potential energy is 
taken to be at the bottom of the 
well for a harmonic oscillator. 


ACHCrey 
kinetic energy 


e is the phase constant of the oscillation. This depends on when the particle 
reaches a position of maximum displacement. If the particle reaches its 
maximum positive displacement at t = 0, then ¢ = 0. 


© wp is the angular frequency of the oscillation. It is related to the frequency f 
and period T by 


27 

wo = 2r f = T` (5.5) 
Since wo = \/C/m, oscillations of high angular frequency are associated with 
large restoring forces acting on particles of low mass, just as you would expect. 
The angular frequency is independent of the amplitude and the phase constant, 
and so characterizes the oscillator under study, no matter how it has been set in 
motion. This is a special feature of simple harmonic motion; it relies on the 
restoring force being exactly proportional to the displacement, and is not true 
more generally. 


Simple harmonic motion can also be analyzed in terms of energy. The potential 
energy function of a simple harmonic oscillator takes the form 
2 
V(x) = 4C2’, 
where C is the force constant. This function is sketched in Figure 5.2. It has its 
minimum value V = 0 at the equilibrium position, x = 0, and increases either 
side of the equilibrium position as the displacement grows in magnitude. 


The particle has momentum py = mv, and kinetic energy 

2_ Pe 

2° 

Adding both contributions to the energy together, the total energy is 
Pe 


R= 
2m 


+ $02. (5.6) 


During the oscillation, energy is continually converted 
between kinetic and potential forms. When the particle is 
near its equilibrium position, most of the energy is in the form 


potential energy total energy of kinetic energy. Near a point of maximum displacement, 


most of the energy is in the form of potential energy. However, 
the total energy always remains constant, in agreement with 
the law of conservation of energy. The kinetic, potential and 
total energies of a simple harmonic oscillator are illustrated 

in Figure 5.3. The graphs of kinetic and potential energy are 


Figure 5.3 The kinetic, potential and total 
energies of a simple harmonic oscillator. The 


similar in shape, but are out of phase with one another. It follows 
that the average kinetic energy is equal to the average potential 
energy, provided that the averages are taken over one or more 
complete periods of the oscillation. 


kinetic and potential energies interchange, but The total energy F is related to the amplitude of the oscillation, 


the total energy remains constant. 
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as you will see in the next exercise. 


Exercise 5.1 A particle performs simple harmonic motion of amplitude A in a 
harmonic oscillator with force constant C. Show that its total energy is 


E = 3CA’. 


5.1 Classical harmonic oscillators 


Exercise 5.2 Use Equations 5.4 and 5.6 to verify that the total energy of a 
harmonic oscillator is independent of time. | 


5.1.1 Diatomic molecules in classical physics 


Simple harmonic motion does not always involve the motion of a single particle 
about a fixed point. For example, molecules can vibrate internally. The simplest 
case is provided by a diatomic molecule — one consisting of two atoms. From a 
classical perspective, a diatomic molecule can be pictured as two balls joined by a 
spring (Figure 5.4). Several types of motion are possible: translational motion of 
the molecule as a whole, rotational motion around an axis, and vibrational motion 
in which the bond between the two atoms alternately stretches and compresses. 
We focus on the vibrational motion here, emphasizing its relationship to the basic 
description of simple harmonic motion given above. Figure 5.4 A classical model 
of a diatomic molecule. 


In a diatomic molecule, such as CO, we have two particles (the carbon and oxygen 
atoms) which interact with one other, rather than a single particle responding to an 
external force. We neglect any external forces acting on the molecule. Then, 
ignoring rotational motion, and placing the x-axis along the line of separation of 
the atoms, the total energy of the molecule is 


2 2 
fat. + P2 + V(x — z1), (5.7) We have omitted the subscript x 
2mı 2m2 on pı and pə for simplicity. 


where m;i, pi and x; are the mass, momentum and position of atom 7. The first 
term in Equation 5.7 is the kinetic energy of atom 1, the second term is the kinetic 
energy of atom 2, and the last term is the potential energy of interaction between 
the two atoms, which depends only on the distance between them. 


Now, a remarkable simplification can be achieved, merely by changing variables. 
We shall not spend time on the algebraic details, but just state the final result. It 
turns out that Equation 5.7 can be rewritten as 


P? p 
E = — + — +V 5.8 
ma (5.8) 
where M = mı + mə and P = pı + pg are the total mass and total momentum of 
the molecule, and the other variables are defined by 


z= g£— tı, b= —— and p= u—. (5.9) 


In Equation 5.8, the first term, P? /2M, is the kinetic energy associated with 
translational motion of the molecule as a whole. This term has nothing to do 
with any internal vibration of the molecule, and we can set it equal to zero by 
agreeing to work in the centre-of-mass frame (the reference frame where the 
total momentum of the molecule is zero). 


@ Why is the centre-of-mass frame more suitable than a reference frame with its 
origin fixed on one of the two atoms? 


O With no external force acting on the molecule, the centre of mass moves 
uniformly, so it provides the origin of an inertial frame. Newton’s laws apply 
in such a frame; the same cannot be said of a reference frame fixed on one of 
the vibrating atoms. 
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In the centre-of-mass frame, the energy becomes 
2 
p 
E= — +V 5.10 
aa t VO (5.10) 


which is just like the energy of a single particle with its potential energy provided 
by a fixed external agency. The mass 
u= EOULS (5.11) 
mı + m2 
is less than either mı or mə, and is called the reduced mass of the molecule. So, 
here is the general point to remember: 


The internal motion of a two-particle system can be analyzed by considering 
the motion of a single particle with the reduced mass. 


If mı < me, the reduced mass is only slightly less than mı, leading to a small 
correction in the results that would be obtained by (wrongly) taking the more 
massive particle to be permanently at rest. In a molecule such as No, however, the 
reduced mass is only half the mass of either atom. 


Finally, let us consider the form of the potential energy function. 
AV (zx) Figure 5.5 shows a graph of potential energy versus separation 
for two atoms in a typical diatomic molecule. The minimum 
in this graph corresponds to the equilibrium separation of the 
atoms. The potential energy rises either side of the equilibrium 
separation as the atoms resist being squashed too closely together 
or pulled too far apart. 


Near the minimum at x = 20, the potential energy function 


0 > 
z can be approximated by 
V(x) ~ V (z0) + $C (z — zo)”. (5.12) 
harmonic P With a suitable choice for the origin of coordinates and zero 
a a a of potential energy (indicated by the dashed set of axes in 
Figure 5.5), this is the potential energy of a harmonic oscillator. 
The approximation is good for oscillations of relatively small 
Figure 5.5 Solid curve: the potential energy amplitude; under these circumstances, we can say that a vibrating 
of interaction for atoms in a typical diatomic molecule undergoes simple harmonic oscillations, with a bond 
molecule, plotted against their separation. length that varies sinusoidally in time. 


Dashed curve: an approximation valid near the 
potential energy minimum; referred to the 
dashed axes, this is the potential energy function 
of a harmonic oscillator. 


Exercise 5.3 A hydrogen molecule consists of two hydrogen 
atoms, each of mass 1.67 x 10-2” kg. The molecule has vibration 
frequency f = 1.25 x 10!4Hz. If the force between the two 
hydrogen atoms can be modelled by Hooke’s law, what is 

the value of the force constant, C? i] 


5.2 Quantum harmonic oscillators 


5.2.1 Schrodinger’s equation 


We now turn to the quantum-mechanical description of a harmonic oscillator. It is 
easy to construct Schrédinger’s equation for this system, following the usual 
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three-step recipe of Chapter 2: write down the classical Hamiltonian function, 
replace variables by operators, and then assemble Schrédinger’s equation. 


The first step is already complete because Equation 5.6 is the total energy of the 
system expressed in terms of momentum, so it is the classical Hamiltonian 
function. Applying the conversion rule 

p2 h2 62 

2m 2m ðr?’ 
we obtain the Hamiltonian operator 


a h2 2 
ae 


It is convenient to express this in terms of the variable wo = ,\/C'/m introduced in We shall call wo the classical 


Equation 5.3. This gives angular frequency, to show 
p h2 82 that we are not prejudging 
=- IA + imwa’. its quantum-mechanical 
significance. 


Finally, inserting the Hamiltonian operator in the general form of Schrédinger’s 
equation, ii OW /Ot = HW, we obtain 
ow h? wv 


an ee + 5mupr? U(x, t). (5.13) 


This is Schrédinger’s equation for a harmonic oscillator. It tells us how any wave 
function describing the state of the oscillator evolves in time. However, we are 
interested in the energy levels of the oscillator, so we shall focus on states that 
have a definite energy — the stationary states. A stationary-state wave function 
takes the form 


U(x, t) = V(x) eE, (5.14) 


where the energy eigenfunction w(x) satisfies the time-independent Schrödinger 
equation for a harmonic oscillator: 
R dy 


o + mwg? W(x) = Ey(x). G15) 


Exercise 5.4 By substituting Equation 5.14 into Equation 5.13, show that 
p(x) satisfies Equation 5.15. a 


The time-independent Schrédinger equation is the eigenvalue equation for energy. 
It is accompanied by the usual requirement that its solutions should not diverge as 
x — oo. If we pick an arbitrary value of E, and try to find a function Y (x) that 
satisfies Equation 5.15, and does not diverge at infinity, we will almost certainly 
fail. Only very special values of FE allow this to happen. The special values of Æ 
that permit non-divergent solutions of Equation 5.15 are the energy eigenvalues of 
the harmonic oscillator. These energy eigenvalues give the possible energies of 
the oscillator. They form a discrete set, so the quantum harmonic oscillator has a 
discrete set of energy levels, which we shall label as Eo, E1, Fo,.... 


Note that the numbering starts 
Each energy eigenvalue F; has a corresponding energy eigenfunction, W(x), and from 0 in this case. 
a Stationary-state wave function 


WU; (x,t) = pile) eith, 
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AFB; 
Es = Thu 
Ea = Shu 
hwo 
= hia 
Eo = hwo 
0 


Figure 5.6 The first four 
energy levels of a harmonic 
oscillator, at the bottom of an 
infinite ladder of equally spaced 
energy levels. 
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This wave function describes a state in which the oscillator is certain to have 
energy E;. The value of the stationary-state wave function at time t = 0 is equal 
to the energy eigenfunction, 7;(a). Both the wave function and the eigenfunction 
are normalized: 


+00 +00 
f Wi(c,t) Vi(e,t)de = | f(a) dia) de = 1, 


—oo = 


expressing the fact that the particle is certain to be found somewhere. 


5.2.2 Eigenvalues and eigenfunctions 


To find the energy eigenfunctions and eigenvalues of a harmonic oscillator, we 
need to solve the time-independent Schrödinger equation for a harmonic oscillator 
(Equation 5.15). From the point of view of physical principles, this is similar to 
the task tackled in Chapter 3, where the time-independent Schrödinger equation 
was solved for an infinite square well. The only difference is one of mathematical 
difficulty: we have a tougher nut to crack in this case. We will therefore begin by 
describing the solutions that emerge. Later, in Section 5.3, you will see how these 
solutions can be obtained from Equation 5.15. 


Energy eigenvalues 
The energy eigenvalues of a harmonic oscillator turn out to be 


En = (n+ 9)hwo forn=0,1,2,..., (5.16) 


where wo = \/C'/m is the classical angular frequency of the oscillator. These are 
the energy levels of the oscillator. As shown in Figure 5.6, the energy levels are 
equally spaced, starting with the lowest energy fu /2, and with a constant 
spacing of hwo between neighbouring levels. 


In classical physics, the oscillating particle can come to rest at the bottom of the 
well, and can therefore have zero energy. By contrast, the lowest possible energy 
of a particle in a quantum harmonic oscillator is Eo = hwo/2. This is called the 
zero-point energy of the oscillator because it persists even at the absolute zero of 
temperature, where one might expect the oscillator to be completely calm. 
Zero-point energy is not unexpected — you saw in Chapter 4 that all confined 
particles have some energy as a consequence of the uncertainty principle — but 
we now See the zero-point energy emerging directly from the energy eigenvalues. 
Note that the lowest energy level, Eo, is assigned quantum number n = 0. This is 
unlike the convention adopted for a square well, so you will need to remember 
this fact. 


The pattern of equally-spaced energy levels shown in Figure 5.6 is a hallmark of a 
harmonic oscillator. In a one-dimensional infinite square well, the energy levels 
get further apart as the energy increases; in an un-ionized hydrogen atom, the 
energy levels get closer together as the energy increases, but the harmonic 
oscillator has an infinite ladder of equally-spaced energy levels; it is unique in this 
respect. 


In practice, of course, a harmonic oscillator is an idealization — stretch any 
rubber band far enough, and it will break. However, we know that the atoms in a 
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diatomic molecule interact via a potential energy function that is approximately AB 
harmonic near the bottom of the well. It is therefore interesting to ask whether 
real diatomic molecules have vibrational energy levels that are similar to those of 
a harmonic oscillator. Figure 5.7 shows the vibrational energy levels of a carbon 
monoxide molecule (CO) inferred from spectral lines observed in the infrared part 
of the spectrum. These energy levels are almost equally spaced, especially 

near the bottom of the well, but become slightly more closely spaced as the 
energy increases. In this case, a harmonic oscillator model provides a reasonable — n=l] 
description of at least the first twenty-one vibrational energy levels. 


0.200V | 


It is worth commenting on the relationship between the energy level diagram in 
Figure 5.7 and the spectral lines emitted by a CO molecule. At first sight, you 
might suppose that a molecule in a high energy level could jump to any lower 
energy level by emitting a photon of energy 


hw = Ej- Bj, ——SSaa 


where F; is the initial energy level, Æ; is the final energy level and w is the 
angular frequency of the emitted light. 


However, the observed spectrum is much simpler than this. Ignoring very faint 
lines, the vibrational spectrum is produced by transitions between neighbouring 
energy levels — between Fo and E39, for example, but not between Fa 

and E7. A rule of this kind is called a selection rule because it selects the ie! 
observed spectral lines from a wider set of possibilities. The selection rule we 
have just quoted is specific to vibrational energy levels. We shall explain why it is 
true at the end of this chapter. 


Il 
qn 


The ground-state energy eigenfunction 


Each energy eigenvalue F; is associated with an energy eigenfunction y; (x). We  0.26eV | 


begin by considering the eigenfunction y(x), corresponding to the ground-state MTN 
energy, Eo = hwo/2. This eigenfunction turns out to have the form 0 
29,2 
polz) = Coe” (aa , (5.17) Figure 5.7 The first 21 


vibrational energy levels of a 


where a is a positive constant and Co is a constant of normalization. Functions of 
CO molecule are almost, but not 


this type are called Gaussian functions, named after the great mathematician 
Carl Friedrich Gauss (1777-1855) who used them in his work on statistics. The quite, equally spaced. 
function W(x) is plotted in Figure 5.8 on page 133. This bell-shaped curve has no 

nodes, it decreases either side of a maximum value at x = 0, and it becomes very 

small for |x| > 3a. 


We can verify that (a) is an energy eigenfunction of the harmonic oscillator by 
substituting it into the time-independent Schrédinger equation. The following 
worked example shows how this is done. As a bonus, we shall confirm that the 
corresponding eigenvalue is ñwo/2 and find a suitable value for the constant a. 


Worked Example 5.1 Essential skill 

Show that the function y(x) = Cye~* /2” is an energy eigenfunction for Verifying that a given 

a harmonic oscillator, provided that the positive constant a is chosen function is a solution of the 
appropriately. Find a suitable value for a, and determine the energy time-independent Schrodinger 
eigenvalue corresponding to this eigenfunction. equation 
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Solution 


We need to substitute the given function into the time-independent 
Schrödinger equation to see whether, under suitable circumstances, it 
satisfies this equation; these circumstances will give the value of a. 


Schrédinger’s equation involves the second derivative of the eigenfunction, 
so we must find the second derivative of W(x). Differentiating once gives 


dyo _ al (Cie) = -5 Coe 2/20? 


dr dz 
Differentiating again gives 


2 2 
d Wo _ d ( iG c] = (-3 a 96 o 


dz? dz a2 


Substituting W(x) into the time-independent Schrödinger equation 


(Equation 5.15) and cancelling the common factor Ce 2a? on both sides, 
we obtain 
hi? 1 r? 


Because this equation is valid for all x, we can extract two separate 
conditions from it. First, setting x = 0 gives 


h2 
= Omaa 


Then, using this result for F, and setting x Æ 0 in Equation 5.18, we obtain 
ales Oe 


== 
2) 2? 
mw 


so a suitable positive value of a is 


i ip 
a = \| —. 
mwo 


With this choice of a, the above calculation confirms that 
W(x) = Coe~* /2@ is an energy eigenfunction of the harmonic oscillator, 
and it also shows that the corresponding eigenvalue is 
2 
h 1 


~ Ima ado 


The ground-state eigenfunction still needs to be normalized. This involves finding 
a suitable value of the normalization constant Co. 


Exercise 5.5 Use a standard integral (given inside the back cover) to show that 


a suitable value of the normalization constant in Equation 5.17 is 
Do not confuse the 


a : 1/2 
normalization constant Co with C = 1 / 
the force constant C = mu. 5 yra l 
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P s A . : . = -22/2a2 
To summarize, the normalized energy eigenfunction corresponding to the Vole } = Coe 


ground-state energy Ep is 


1 1/2 2/972 
volz) = ( =) ae ae (5.19) 


where 
a= 4j EN (5.20) 

MWO 
The constant a is the only combination of A, m and wọ (devoid of numerical Sn i 
factors) that has the dimensions of length. It can therefore be regarded as a 
characteristic length for the oscillator, setting a length scale for its quantum Figure 5.8 The ground-state 
properties. We shall call a the length parameter of the oscillator. energy eigenfunction of a 

harmonic oscillator is a 

Exercise 5.6 Gaussian function. 


(a) Show that a has the dimensions of length. 


(b) Show that a classical harmonic oscillator, with an energy equal to the 
ground-state energy Eo = ħwo/2, has an amplitude equal to a. m 


Exercise 5.6b prompts a comparison between classical and quantum physics. 

If classical physics were valid, a harmonic oscillator with energy Eo would 

have amplitude a, and the oscillating particle would be confined to the region 

—a < x <a. The particle would not have enough energy to stray any further from 
the origin. But this is not what is found in quantum physics. Figure 5.8 shows that 
the energy eigenfunction 7o(x) extends beyond the region —a < x < a, so there 
is some chance of finding the particle in the classically forbidden region |x| > a. 
This is an example of barrier penetration, a phenomenon you first met for a 
finite square well. You should appreciate that barrier penetration does not always 
involve the motion of a single particle. In the case of a diatomic molecule, it 
allows the two atoms to be further apart, or closer together, than would be 
possible for classical particles with the same energy. 


Exercise 5.7 A particle is in the ground state of a harmonic oscillator. What is 
the probability of finding it in the classically forbidden region |x| > a? 


(You may find the definite integral f. a e7?’ da = 0.139 useful.) E 


Higher energy eigenfunctions 


Finally, let us briefly survey the other energy eigenfunctions. This is mainly 
for reference purposes: you will not need to remember the details, but you 
should be aware of the general pattern that emerges for the eigenfunctions 
Wo(x), Y1(x), polz), ..., corresponding to the eigenvalues Eo, Fy, E2, .... 


It turns out that all the energy eigenfunctions in a harmonic oscillator are of the 
form 
sga : : 72/292 
constant x ‘polynomial in =| x [Gaussian function e7% /20 |. 
In the ground state, the polynomial is equal to 1, but higher energy eigenfunctions 
have increasingly complicated polynomials. The Gaussian function, however, 
always remains the same. 
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The polynomials that appear in the harmonic oscillator energy eigenfunctions are 
of a kind investigated by mathematicians in the nineteenth century. They are 
called Hermite polynomials, in honour of the French mathematician Charles 
Hermite (1822-1901). For this reason, the energy eigenfunctions are usually 
written as 


Do not confuse the Hermite Une) = Ca, (27a), e @21) 
polynomial H,, with the f 


Hamiltonian operator Hi. where H,,(x/a) is the nth Hermite polynomial in x/a, and C» is a normalization 


constant. The first few Hermite polynomials and normalization constants are 
listed in Table 5.1. In general, H,,(2/a) is a polynomial of order n, so the highest 
power of a /a that appears in it is (x/a)”. Only even powers of x /a are present 
when n is even, and only odd powers when n is odd. 


Table 5.1 The first few Hermite polynomials, 
H,,(x/a), and normalization constants, Cn. 


n H,(x/a) Cr, 


Figure 5.9 The first four energy eigenfunctions of a harmonic oscillator. 
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Using Equation 5.21 and Table 5.1, the first few energy eigenfunctions are plotted 
in Figure 5.9. Notice the general pattern: the eigenfunctions are even or odd, 
depending on whether the quantum number n is even or odd. The ground state 
Wo(x) has no nodes, and each successive eigenfunction gains one additional 
node, so (xz) has n nodes. Although not obvious from the figure, the energy 
eigenfunctions form an orthonormal set, so 

ift=J, 


/ Hl elo ae=H5= i; ifi Fj. 


Each of the energy eigenfunctions exhibits the phenomenon of barrier penetration 
mentioned earlier, For an eigenfunction with quantum number n, the energy is 
En = (n +4 )hwo and the classical amplitude of oscillation is 


PE = V2n+ la. 


These eo are marked on Figure 5.9, and you can see that each energy 
eigenfunction extends into the classically forbidden region, |x| > dp. 


(5.22) 


(5.23) 


In classical physics, an oscillating particle moves quickly through its equilibrium 
position, and more slowly near the points of maximum displacement, where it 
comes momentarily to rest. It follows that a classical oscillator spends more of its 
time near points of maximum displacement than near the equilibrium position. It 
is interesting to ask if a similar feature is found in quantum physics. Figure 5.10 
plots the probability density functions, |7,,(a)|?, for the ground state (n = 0) and 
a highly excited state (n = 20). The ground state defies our classical expectations, 
but the highly excited state is in broad agreement with the classical picture. It is 
often the case that quantum-mechanical predictions approach classical predictions 
in the limit of high quantum numbers. This is not altogether surprising, because 
familiar objects, which obey classical physics, do have very high quantum 
numbers. The inventors of quantum mechanics, especially Bohr and Heisenberg, 
were strongly guided by this general idea, and dignified it with a title — the 
correspondence principle. 


A lol? 


Refer to Exercise 5.1 and recall 
that En = (n + 5) hwo and 
C= mup. 


Figure 5.10 The probability 
density |7),,|? plotted for two 
energy eigenfunctions of a 
harmonic oscillator: (a) n = 0; 
(b) n = 20. The dashed line 
shows the proportion of time 
spent by a classical harmonic 
oscillator in each region, 
averaged over a complete cycle. 


terol? 


RY 
© 


(a) (b) 


RY 
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5.3 Solving the time-independent Schrodinger 
equation 


This section will justify the statements made in the preceding section about the 
energy eigenvalues and eigenfunctions of a harmonic oscillator. In the course of 
doing so, it will introduce two new operators, called the raising operator and the 
lowering operator. These operators turn out to be powerful tools, which will help 
us simplify many calculations for harmonic oscillators. It will not matter too 
much if you forget the details of the derivations given in this section, but it is 
vitally important for you to understand how raising and lowering operators are 
used. The derivations are important in illustrating this. 


5.3.1 Choosing appropriate variables 


A first step towards solving the time-independent Schrödinger equation is to write 
it in a compact form by collecting together some of the constants that appear in it. 


Recalling that a = \/h/mwo, we have 
Rì? Rw ( h ) ħwo 9 
= = a 


2m 2 MWO 2 

and 
1 2 wo (m0) hwo 1 
5 MW = — $ 
E h 2 a2 


These two relationships allow us to write the time-independent Schrödinger 
equation for a harmonic oscillator (Equation 5.15) in the form 


where the Hamiltonian operator can be written as 
a 1 fx o? 
f = (: a? ) hwo. (5.24) 
$ 


This consists of two terms: the first (proportional to x”) is the potential energy 
term and the second (proportional to 0? /Ox”) is the kinetic energy term. For 
generality, we have used partial derivatives, but these can always be replaced by 
ordinary derivatives when f acts on a function that depends only on x, such as 
Wn(x). 

This way of writing the Hamiltonian operator is convenient because both 

x and a have the dimensions of length, so the term in round brackets is 
dimensionless, while the factor Awo has the dimensions of energy. This shows that 
a quantum-mechanical harmonic oscillator is characterized by a length, a, and an 
energy, ħwo. It is hardly surprising that the energy eigenfunctions are all functions 
of x/a, and the energy eigenvalues are all multiples of fwo. 


5.3.2 Factorizing the Hamiltonian operator 


The next step towards solving the time-independent Schrödinger equation is based 
on the observation that the term in round brackets in Equation 5.24 looks like a 
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difference between two squares. In ordinary algebra, we know that any difference 
between two squares can be factorized: if m and n are two numbers, we have 

m? —n* =(m—n)(m+n). 
Equation 5.24 appears to involve a similar expression; instead of numbers we 
now have operators that act on functions, but we can ask whether a similar 
factorization can be achieved. To investigate this, consider 


= a) (taZ), PN 


a a 


where f(x) is any reasonable (i.e. smooth) function of x. Multiplying out the 
brackets gives 
2 2 
x 20°f 0 of 
=. 5.26 

f(x) — a° 55 — a O H (5.26) 
Notice that the third term requires us to differentiate the product x f(x). Carrying 
out this differentiation and cancelling terms, we obtain 


(2-a2)(2 +02) se) = 5 r@- eS - pay, 


a 
a a Ox Ox? 


a2 


Hence 


2 2 
Hy 2 0 l E o ) C o ) | 
Steg E = 1 ; 
(5 j Z) Fla) a “ðr/\a oe Ox A) 
This is true for all smooth functions f(x), so we can express the result as an 


identity between operators. Including the factor of 1/2 that appears in the 
Hamiltonian, we have 


1px? 2 0? 172 OV 7a o 1 

= ; 5.27 
Ae j aa) le ase) PP ae) Cea 
You can see that our attempt at factorization almost works, but there is a 
remainder of 1/2. We can also take the two factors in Equation 5.25 in the 
opposite order. An almost identical calculation leads to 


(G eia) =E t) az) 7 Oem 


so in this case the remainder is —1/2. 


These ‘factorizations’ turn out to be very useful, so we will introduce a notation 
that represents them more compactly. We define two new operators: 


AS a (z-a5), (5.29) 
â- (Z+a52). (5.30) 


For reasons that will emerge shortly, Al is called a raising operator and Ais 
called a lowering operator. It is conventional to place a dagger on the raising 
operator and to leave the lowering operator unadorned. Notice that the dagger on 
the raising operator (which looks a bit like a plus sign) is accompanied by a minus 
sign in the definition of the operator. 
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Figure 5.11 Paul Dirac 
(1902-1984) was one of the 
founders of quantum mechanics. 
Dirac won the Nobel prize for 
physics in 1933. 
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Using these definitions, together with Equation 5.27, the Hamiltonian operator 
can be expressed in the “factorized’ form 


fe (A'A n 3) fo. (5.31) 


Remember that A‘ and A are first-order differential operators (Equations 5.29 
and 5.30), so we have expressed a second-order differential operator, H, in terms 
of two first-order differential operators. You will soon see why this is useful. 


First, we must make a very important point about the ordering of the operators mM 
and A. Subtracting Equation 5.27 from Equation 5.28, we obtain 


AN ARE (5.32) 
which, of course, means that 

AA' f(x) # Â' A f(2), 
so the ordering of the two operators obviously matters. 


In mathematical terms, the expression on the left-hand side of Equation 5.32 

is called the commutator of A and Al, and the whole equation is called a 
commutation relation. The fact that the commutator does not vanish is also 
expressed by saying that the two operators do not commute with one another. The 
concept of non-commuting operators is familiar in everyday life, where the order 
of activities is often vital. Superman puts his underpants on after the rest of his 
clothes, while I aim to do the reverse; the effects are not similar! In the present 
context, it means that you must take care when writing down Equation 5.31. 

The order Al Ris called the normal ordering of Al and A. We shall always 
use normal ordering when expressing the Hamiltonian operator of a harmonic 
oscillator in terms of raising and lowering operators. 


5.3.3 Obtaining the eigenvalues and eigenfunctions 


We shall now present an argument that leads to the energy eigenvalues and 
eigenfunctions of a harmonic oscillator. The argument takes a surprising route, 
originally discovered by Dirac (Figure 5.11). Discovering the route took genius, 
but the individual steps are not difficult to follow. 

The eigenvalues and eigenfunctions are found by solving the time-independent 
Schrédinger equation, Hw = Ew. We use Equation 5.31 to express the 
Hamiltonian operator f in terms of raising and lowering operators. The 
time-independent Schrödinger equation then takes the form: 


(A'A ah 3) hwy y(x) = Ey(2). (5.33) 


The first rung on the energy ladder 


Rather than trying to find all of the solutions of this equation at once, we begin by 
noting that Equation 5.33 would be satisfied if 


Ây(z)=0 and E= ify. (5.34) 


5.3 Solving the time-independent Schrödinger equation 


Writing out the equation Av (a) = 0 in full, we see that 
ce o 
=|= == = 0. 
( Ea Ox ) ve) 
This gives the first-order differential equation 


d 
Y -E ula), (5.35) 


which has the solution 
(a) = Coe 2/2”, (5.36) 
where Co is a constant. 
@ Verify that y(x) in Equation 5.36 does satisfy Equation 5.35. 
O Differentiating w(x) = Co e72°/2a? with respect to x gives 
dy _ -2x 
dr 2a? 
as required. 


T 
x Ope "28 =~ Y(2), 


Our analysis has shown that y(x) = Co e77?/20° is an energy eigenfunction — 
a solution of the time-independent Schrödinger equation — with eigenvalue 
ħwo/2. You may recognize it as the eigenfunction discussed earlier in Worked 
Example 5.1. We can be sure that this is the ground-state eigenfunction because 
it has no nodes; this is a characteristic property of ground-state energy 
eigenfunctions, not shared by those of other states. We therefore denote it by 
Wo(x). Exercise 5.5 showed that the appropriate normalization constant is 


Co = 1/(./m0)1/2, so 


1 1⁄2 279,2 
Yo(x) = ( =) gea (5.37) 


is the normalized ground-state eigenfunction of a harmonic oscillator, and the 
corresponding energy eigenvalue is Eo = ñwo/2. This is the lowest rung on the 
ladder of energy levels. 


Climbing the energy level ladder 


We have found the ground-state eigenfunction and eigenvalue, but what about all 
the excited states? Fortunately, the raising operator can be used to find many more 
eigenfunctions. To see how this works, consider the function 

x 


itla) =e (E- a5.) vote). 


We will show that this is also an eigenfunction of the Hamiltonian, with a higher 
eigenvalue than wo. To accomplish this, we apply the Hamiltonian operator to 


Aldo, giving 
A[A'yo(a)] = (ATA + 4)Awo[A'do(2)]. 
Leaving the order of the operators Al and A undisturbed, and remembering that 5 


and fwp are just numbers, we can carefully pull out a factor A! from terms on the 
right-hand side to obtain 


[A yo (x)] = AT(AAT + 2) hwo yola). 
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Raising and lowering 
operators are also called 
ladder operators. 
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Now, the operators A and At in the term contained in round brackets are not in 
their normal order. This can be rectified by using the commutation relation 


A Al — Al A=1 (Equation 5.32) to write 
AA'=A'A+1, 
giving 
F[A'do(x)] = A'(A'A +1 + 2) hwo yole) = Â (Ê + Aw) yola). 


We already know that w(x) is an eigenfunction of H with eigenvalue fiwy/2, so 
we conclude that 


F[A'do(a)] = Al (hwo + hwo) Vo(x) = Shwo[A'vo(a)], 
which shows that Aldo(z) is an eigenfunction of the Hamiltonian, with 
eigenvalue 3ñwo/2. 


The nice thing about this argument is that it can be generalized. Given any energy 


eigenfunction w(x), with eigenvalue E,,, we can form a new function Aldn(w). 
Then, repeating the above argument, we find that 


[Al wn()| = A (A + hwo) dn(2) = (En + hwo) [A yne), (5.38) 


which shows that 


Al dn (2) is an eigenfunction of H with eigenvalue Ep + hwo. 


Similar arguments apply to the operator A, but in this case we have 
H[A ¢n(x)] = (En — hwo) [A Yn(a)], (5.39) 


and so 


a~ 


A Yn (z) is an eigenfunction of H with eigenvalue E;, — hwo. 


Exercise 5.8 Prove Equation 5.39, using a method similar to that used for At 
but making earlier use of the commutation relation between A and a E 


It should now be clear why Al and Â are called raising and lowering operators. 
Figure 5.12 summarizes their effects: Al climbs a rung on the ladder of energy 
levels, while A steps down a rung. The descent stops at Yyo(x) because we know 
that A polz) = 0, so the lowering operator generates no new eigenvalues below 
Eo = hwo / 2, 

These discoveries are remarkable because they reveal a never-ending ladder of 
energy eigenvalues. Starting from the ground state, with energy hup/2, and 
repeatedly applying the raising operator, we can produce an infinite set of energy 
levels, hwo, 3hwo, 3hwo, aaa 


5.3 Solving the time-independent Schrödinger equation 


AE 


fo = 3 hwo 
Pall ok arf 
Tia Ei 3 hwo 


Figure 5.12 Climbing up and down the ladder of energy levels. 


Can we be sure that this process generates all the energy levels? In fact, we can. 
To see why, let us start from an arbitrary energy level FE, and repeatedly apply the 
lowering operator, A. This generates energy levels E — hwo, E — 2hwo,.... If 
this process continued forever, we would eventually reach energy levels below the 
ground state, which is absurd. The only way to avoid this infinite descent is for 
the lowering process to produce a function (x) for which A y(x) = 0. Then, 
further applications of the lowering operator produce no new functions and the 
descent ceases. But we have already investigated the equation A w(x) = 0 and 
seen that its only solution is the ground-state eigenfunction, with eigenvalue 

5 hwo. Since the process of descending from F in steps of hwo must eventually 
reach 5 hwo, it follows that all the eigenvalues are a whole number of fwg steps 
above 5hwo. In other words, all the energy levels are given by 


En = (n+ 5) hwo, where n = 0,1,2,..., 
which confirms Equation 5.16. 


Finally, let us briefly consider the energy eigenfunctions. We have shown that 
the raising and lowering operators allow us to move between neighbouring 
eigenfunctions, but nothing in our analysis has shown that these operators take a 
normalized eigenfunction and produce another normalized eigenfunction. In 
fact, it turns out that these operators do not preserve normalization. Detailed 
calculations show that if Yn—1(x) and w(x) are normalized energy 
eigenfunctions, then 


A bn—1(2) = Vn prls), (5.40) Note that, in both cases, the 
ie coefficient of the final 
A n(x) = V/nn_1(2). (5.41) eigenfunction involves the 


higher of the two quantum 


These equations make sense because they imply that numbers being linked. 


Al Â yala) = Al Vn yn- (2) = Viva n(2) = nyala), 


which is necessary to obtain the correct energy eigenvalues: 
H wn (x) = (ATA + 3) hwo Un(x) = (n + 5) hwo n(x). 


Because the operator A’ A has the quantum numbers n = 0,1,2,... as its 
eigenvalues, it is often called the number operator. 
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To find the normalized eigenfunction y,(x), we rewrite Equation 5.40 in the form 
S 
Ynr(x) = va Abn (2), (5.42) 


and repeatedly apply this result to give 


vale) = (sy) (2-02) woo. 


Inserting the ground-state eigenfunction (Equation 5.37), we finally obtain 


= 1 1/2 6 ð ý —a? /2a? 5.43 
Vn(x) = ae ie es a (5.43) 


The effect of repeatedly applying the operator (x/a — a 0/02) is to leave the 
Gaussian term e~”/20° unchanged, and to bring down increasingly complicated 
polynomials in front of it; these are precisely the Hermite polynomials mentioned 
in Section 5.2. 


You could use Equation 5.43 to find the first few energy eigenfunctions, and 
compare your answers with Table 5.1. However, this is not really necessary. One 
of the great advantages of raising and lowering operators is that they allow us to 
explore the quantum properties of harmonic oscillators without using any explicit 
formulae for the energy eigenfunctions; this is the subject of the next section. 


5.4 Quantum properties of oscillators 


Raising and lowering operators have an importance beyond the derivation of 
energy eigenvalues and eigenfunctions. In this section, we shall use them to help 
understand the quantum-mechanical behaviour of harmonic oscillators. We 

shall look at the expectation values and uncertainties of various observable 
quantities, including position, momentum, kinetic energy and potential energy, in 
different stationary states. You will also see why the selection rule mentioned in 
Section 5.2.2 applies to harmonic oscillators. 


5.4.1 Expectation values and uncertainties 


If a system is in a stationary state Yn (x, t) = n(x) eirth 


value of any observable O is 


, the expectation 


(0) = a U* (x,t) Ô Y, (cr,t) de 


_ T O eu) 6 (Yale) ee) dr 


—co 


f j (2) Ô nlt) de, 


and so depends on the normalized energy eigenfunction Yn (2). The eigenfunction 
is independent of time, and so is the expectation value. 
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For a harmonic oscillator, we could evaluate such integrals by inserting the 
appropriate eigenfunctions. However, you have seen that these eigenfunctions are 
messy combinations of normalization constants, polynomials and Gaussian 
functions, so evaluating the integrals would be an arduous task. Fortunately, 
raising and lowering operators can save us a great deal of trouble, as we shall now 
show. 


The key point is that position and momentum operators can be expressed in terms 
of raising and lowering operators. Combining Equations 5.29 and 5.30 and 
recalling that X = x and P, = —ihO/Oz, we obtain 


Cay, (5.44) . 
Note that the denominator of 

i Equation 5.45 is 2a, not V2a. 

p= (A —A'). (5.45) 


Consequently, the expectation value of position can be rewritten as 
a ~” AA 
(x) = 5f ph (x) (A + A") Wn (x) dz. (5.46) 
V2 J->% 


The advantage of doing this is that A and Â' have known, simple effects. They 
are lowering and raising operators, so we must have 


(A T A’) Wn (2) = Oy 1(2) a Bn+1(2), (5.47) 


where a and ĝ are constants. (In fact, a = y/n and 8 = yn + 1, from 
Equations 5.40 and 5.41, but this detail is not needed.) We only need to substitute 
Equation 5.47 into Equation 5.46 to obtain 


(2) = Fe | Vile) [obn-s(e) + Bbusa(a)] de 


Then, using the fact that Y, (x) is orthogonal to both w,_1(x) and Yn+1ı(£), we 
immediately see that (x) = 0. An almost identical argument applies to (pr), 
which is also equal to zero. 


Worked Example 5.2 Essential skill 

Show that a harmonic oscillator in a stationary state with quantum number n Calculating expectation values 

has using raising and lowering 
Ge (n + 5) ie (5.48) oa 

Solution 


We begin by using Equation 5.44 to write 
2 a ee * ae Ril 2 
(oo) = T WT) (A+A ) Vaada: 
=E5 
It pays to think carefully about the effect of the operators. Taking care to 
maintain their ordering, we have 
Be S EN) 
=AA+A'A'+ATA+ AAT 
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Consider what each term in this sum does to Yp(x). The first two terms 
convert Yn(x) into different eigenfunctions (Yn—2(x) and Yn+2(x)). 
Because 7,,(x) is orthogonal to these eigenfunctions, we can safely drop 
these terms from the integral, leaving only 


(x?) = a w(x) (ATA + Â Â') dn(a) dz. 


The surviving terms have equal numbers of raising and lowering operators. 
Our tactic now is to express everything in terms of the number operator, 


A’ A, because we know that this has a simple effect on y,(x). Using the 
commutation relation A At — At A= 1, we obtain 


SO 

(ATA +A AT) a(x) = (2ATA + 1) ynl) = (2n + 1) Y(T), 
and 

a | doe dea GL ae 
as required. 


Exercise 5.9 Show that a harmonic oscillator in a stationary state with 
quantum number n has an expectation value of p? given by 


h2 
(pt) = (n+ 3) a. (5.49) 
isl 


The two results just derived for (x?) and (p2) allow us to find the expectation 
values of potential and kinetic energy in a stationary state with quantum 
number n. For the potential energy, 

(Epot) = (mug?) = smu eu?) = i(n + $) moga’, 


and for the kinetic energy, 


(Pz) _ ar, 1) 
(Ekin) = = = 3(n+3) ma 
The ratio of the kinetic and potential energy expectation values is 
(Ekin) h? 


(Epo) Mewat  ” 


where we have used the fact that a = \/h/mwp. Hence the expectation value of 
the kinetic energy is equal to the expectation value of the potential energy. This is 
the quantum-mechanical version of the result illustrated in Figure 5.3, where the 
classical kinetic and potential energies, averaged over one cycle of oscillation, 
were the same. 
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We can also use our expressions for (x) and (p2) to find the uncertainties in 
position and momentum. Because (x) and (py) are both equal to zero, we have 


As = ye- = fe?) = nt Bo, (5.50) 


h 
2 
Ape = y (p) — (pe)? = y 02) = nt 3, (5.51) 
Multiplying these uncertainties together gives 
Az Ap; = (n + 5)h. (5.52) 


Three points are worth noting: 


1. Equation 5.52 is consistent with the Heisenberg uncertainty principle, which 
requires that Ax Ap, > //2 in any state. 


2. The ground state of a harmonic oscillator (n = 0) has the minimum possible 
uncertainty product, Ar Ap, = h/2. 


3. Highly-excited states (with n > 1) have uncertainty products that are much 
larger than h/2, but this is fully consistent with the uncertainty principle, 
which is only an inequality. 


5.4.2 A selection rule 


Finally, we can use raising and lowering operators to gain an insight into the 
selection rule mentioned in Section 5.2.2. To take a definite case, we consider a 
vibrating diatomic molecule, such as HCl. In this molecule, one of the atoms (Cl) 
carries a slightly negative charge, while the other (H) carries a slightly positive 
charge, so the molecule can be thought of as an oscillating electric dipole. In 
classical physics, we would explain the absorption or emission of light from the 
molecule by treating it like a tiny radio aerial. 


A very different picture applies in quantum physics, where the molecule has a set 
of equally-spaced vibrational energy levels. Neighbouring levels are separated by 
hwo, where wo is the classical angular frequency of vibration. If there were no 
restrictions on transitions between the energy levels, a highly-excited molecule 
would emit photons of energies hwo, 2ħwo, 3hwo, and so on. But this is not what 
is observed. 


If we ignore extremely faint spectral lines, we only get photons of energy wp, 
corresponding to transitions between neighbouring levels. A transition that is 
accompanied by the absorption or emission of a photon is generally called 


a radiative transition, so we can also say that the radiative transitions in a Radiative transitions between 
harmonic oscillator link neighbouring levels. This is the selection rule we vibrational levels of CO» are a 
mentioned earlier. Where does it come from? cause for serious concern; via 


the greenhouse effect, they are 
responsible for the largest 
man-made contribution to global 
warming. 


At the end of the course, we will describe the quantum theory of light interacting 
with matter, and you will see that, under most conditions, the rate of transitions 
from one state to another obeys the rule 

oo 2 
rate of transitions œ% f WE) LD, (ede) , (5.53) 
— 00O 


145 


Chapter 5 Simple harmonic oscillators 


in = al 
so 
ieee 
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a= 
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Q 70 time/s 


Figure 5.13 A single electron 
trapped in a potential energy 
well where its Hamiltonian is 
that of a harmonic oscillator. 
The electron performs quantum 
jumps between neighbouring 
energy levels. 
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where W(x) and Yn(x) are the energy eigenfunctions of the states involved. We 
cannot justify this formula here but, taking it on trust, we can trace the origins of 
the selection rule. 


Exercise 5.10 Show that the integral in Equation 5.53 is equal to zero unless 
n=mHi. a 


This establishes the selection rule: in a harmonic oscillator, radiative transitions 
link neighbouring energy levels. It is possible to observe this selection rule in a 
very direct way. Most experiments in quantum mechanics involve data from a vast 
number of atoms or molecules, but it has recently become possible to observe the 
behaviour of single particles. Figure 5.13 shows the results of an experiment that 
continuously monitors the energy of an electron trapped in a harmonic potential 
energy well. It is observed that the electron always jumps from one energy level 
to the next, never changing its quantum number by more than one. If you have 
ever doubted the reality of quantum jumps between discrete states, here is firm 
evidence! 


Summary of Chapter 5 


Section 5.1 Classical simple harmonic motion involves sinusoidal oscillations 
in response to a restoring force that is proportional to the displacement of a 
particle from an equilibrium position. The angular frequency of the oscillation is 
wo = \/C/m, where C is the force constant and m is the mass of the particle. 


The internal motion of a two-particle system, composed of particles of masses mı 
and 72, can be analyzed by considering the motion of a single particle with the 
reduced mass p = mym2/(m1 + mə). 


Section 5.2 The time-independent Schrödinger equation for a harmonic 
oscillator takes the form 


The energy eigenvalues form an evenly-spaced ladder 

En = (n+ 5) hwo for n = 0,1,2,..., 
but transitions that involve the absorption or emission of photons are effectively 
restricted to those between neighbouring energy levels. 
The corresponding energy eigenfunctions take the form 

ope 2 2 

ale) = CrHn(z/a)e"* 
where Cn is a normalization constant, H,,(x/a) is an nth order Hermite 
polynomial in «/a, and e~®*/20 is a Gaussian function. The constant 
a = \/h/mu is the length parameter that characterizes the oscillator. 


The energy eigenfunction n(x) has n nodes; it is even when n is even, and odd 
when n is odd. The energy eigenfunctions form an orthonormal set. 


Section 5.3 The Hamiltonian operator for a harmonic oscillator can be 
expressed as 


n 


f = (A'A + 3) hwo, 


Achievements from Chapter 5 


where A! and A are raising and lowering operators. These operators convert any 
energy eigenfunction into a neighbouring eigenfunction. If the eigenfunctions are 
normalized, then 


Adn(x) = Vntn-r(x) and Alyn_1(x) = Vn pnl). 
By using this representation, and repeatedly applying AT to the ground state, it is 
possible to generate all the eigenfunctions and eigenvalues. 


Section 5.4 Expectation values and uncertainties of x and py are conveniently 
calculated by expressing X and P, in terms of the raising and lowering operators, 


and using the commutation relation A At — ATA =1. The selection rule for 
radiative transitions in vibrating molecules can also be derived using raising and 
lowering operators. 


Achievements from Chapter 5 


After studying this chapter you should be able to: 
5.1 Explain the meanings of the newly defined (emboldened) terms and 
symbols, and use them appropriately. 


5.2 Compare and contrast the quantum and classical behaviour of harmonic 
oscillators. 


5.3 Write down the Schrödinger equation and the time-independent Schrödinger 
equation for a harmonic oscillator. 


5.4 Recall the energy eigenvalues of a harmonic oscillator, and describe, in 
general terms, the properties of the energy eigenfunctions. 


5.5 Verify that given functions are energy eigenfunctions of a harmonic 
oscillator, and determine their eigenvalues. 


5.6 Show that if Yn(x) is an energy eigenfunction with eigenvalue E,,, then 
Al dn (2) is another energy eigenfunction, with eigenvalue En + hwo. 


5.7 Calculate expectation values and uncertainties using raising and lowering 
operators. 


5.8 Describe how the vibrations of a diatomic molecule can be modelled 
quantum-mechanically. 
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There are two multimedia 
sequences associated with 
this chapter: one supports 
your study of wave packets 
in wells, and the other 
explores free-particle wave 
packets. The first sequence 
can be studied at the end of 
Section 6.3, or they can 
both be studied at the end 
of the chapter. 
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Chapter 6 Wave packets and motion 


Introduction 


In the real world, electrons, cannonballs and planets move, molecules and 
atomic nuclei vibrate and jellies wobble. We claim that quantum mechanics is a 
comprehensive theory, with Newtonian mechanics applying as a special case for 
relatively large and heavy objects, yet so far we have only studied the frozen 
world of stationary states. 


Schrédinger’s equation describes how quantum systems develop in time. 
However, soon after introducing this equation, we focused attention on a special 
type of solution known as a stationary state. Each stationary state corresponds to 
a definite energy, but the time-dependence of a stationary state is exceedingly dull. 
In a stationary state, the probability density for position is independent of time; 
whatever happened to the motion that Schrédinger’s equation was supposed to 
describe? 


To describe motion in quantum mechanics, we need to consider solutions of 
Schrédinger’s equation known as wave packets, each wave packet being a linear 
combination of different stationary states. These solutions are not themselves 
stationary, and you will see that they can describe motion. 


Wave packets have other uses as well. By comparing the motion of wave packets 
with the motion of classical particles, we can gain some insight into the 
relationship between quantum physics and classical physics. We seem to have two 
worlds: an everyday world and an utterly different quantum world. We feel that 
there must be some correspondence, some smooth transition, between these two 
realms, but what is its nature? Wave packets help to answer this question. Finally, 
wave packets provide the key to solving the problem of the continuum. You may 
recall that earlier chapters concentrated on bound stationary states because the 
unbound stationary states in the continuum (corresponding, for example, to 

the ionized states of atoms) cannot be normalized and so cannot represent 

any physically realizable state. You will see that the state of a particle in the 
continuum is best described by a wave packet. 


The chapter is organized as follows. First we compare stationary states and 

wave packets, looking at examples of wave packets in a harmonic oscillator 
(Section 6.1). In such an oscillator, you will see that the expectation values of 
position and momentum depend sinusoidally on time, just like the position 

and momentum variables of a classical oscillator. This is part of a general 

result, known as Ehrenfest’s theorem, which provides an important link between 
quantum mechanics and classical mechanics. Ehrenfest’s theorem is discussed in 
Section 6.2. Finally, we consider the problem of predicting motion. Given an 
initial wave packet, you will see how the time-evolution of the wave packet can be 
predicted. Section 6.3 considers wave packets in a simple harmonic oscillator and 
Section 6.4 considers wave packets that describe free particles; you will see how 
these wave packets spread with time. 


6.1 Time-dependence in the quantum world 


6.1 Time-dependence in the quantum world 


There is nothing in the everyday world corresponding to the stationary states 
studied in previous chapters. Electrons in an old-fashioned TV tube are emitted by 
a cathode and accelerated towards a screen, where they give up their energy with 
the emission of light: they move! So do the ions flowing through your brain, 
giving rise to thoughts and emotions. If quantum mechanics is a comprehensive 
theory, it must be capable of describing moving particles. This section will 
confirm that stationary states cannot describe motion, before introducing wave 
packets, which do provide the appropriate description of motion in quantum 
mechanics. 


6.1.1 The frozen world of stationary states 


A stationary state is a solution of Schrddinger’s equation in which the wave 
function is a product of a function of position and a function of time: in one 
dimension, ¥(x, t) = Y(x)T(t). In Chapter 2 we substituted this product 
function into Schrédinger’s equation and used the method of separation of 
variables to show that (x) satisfies the time-independent Schrödinger equation 
at a fixed energy E, while T(t) is a phase factor of the form T(t) = ei*/", Tt 
follows that any stationary-state wave function can be written as 


U(x, t) = y(x) Eh, (6.1) 


The energy F is interpreted as the energy of the system in this state. This means 
that the state described by Equation 6.1 has the definite energy Æ; every 
measurement of energy in this stationary state yields the value Æ. 


For present purposes, the most important point about stationary states is their 
simple time-dependence. The wave function of a stationary state does depend on 
time, but only via a phase factor e~‘”“/", Now, it is a general principle of quantum 
mechanics that an overall phase factor, multiplying the whole wave function, has 
no physical significance, so we should not expect the physical properties of a 
stationary state to depend on time. 


To illustrate this point, consider the probability of finding the particle in a small 
interval da, centred on the point x, in the stationary state described by 
Equation 6.1. As you have seen before, we can use Born’s rule to obtain 


probability = Y* (x, t)U (x,t) dx 
— y*(xjet Eth x y(x)je E Sx 
= p“ (x)p(2) dx, 


which is independent of time. In this sense, we can say that the stationary state 
does not describe motion at all. 


Stationary states are static in other respects too. In a stationary state, the 
expectation value of an observable O is given by the sandwich integral 


(0) = a W*(x, t)OW(x, t) da 


2 I. COo ô (laje) dx, 
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This follows from the coefficient 
rule introduced in Section 4.2. 
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where O is the operator corresponding to O. 


In general, the operator O does not depend on time, so it does not affect the 
time-dependent phase factors, which can be taken outside the integral. We 
therefore have 


(O) _ etikt/ha—ikt/h T w*(x)Ow(z) dy 


Ai i y" (a)Ov(e) de, 


which is independent of time. So, the expectation values of any observable — 
position, momentum, kinetic energy, or whatever — remains constant in time in 
any stationary state. To describe motion, and the obvious fact that the expectation 
values of observable quantities can vary in time, we must look beyond stationary 
states. 


6.1.2 The dynamic world of wave packets 


Schrédinger’s equation is linear, so if Y1 (x,t), Vo(a,t),...U,(a, t) are all 
solutions of Schrédinger’s equation for a given system, and a1, a2, ... Gy, are 
complex constants, then the linear combination 


P(x, t) = a,WVy4 (a, t) + agWo(ax,t) +... + anVp(z,t) 
is also a solution of Schrédinger’s equation for the system. This is the principle of 
superposition: a linear combination of stationary-state wave functions satisfies 


Schrédinger’s equation. A normalized linear combination of two or more 
stationary-state wave functions is called a wave packet. 


For simplicity, we will begin by considering a wave packet of just two stationary 
states: 


W(x, t) = arpi (zje iP + anyo(a)e 7t, (6.2) 


where Ey Æ E1. This wave function cannot be written as a product of a function 
of position and a function of time, so it is not a stationary state. If the energy of 
the system is measured in the state described by Equation 6.2, one or other of the 
values F; or £2 will be obtained, and the energy of the system is indefinite to this 
extent. However, we can say that the probability of obtaining energy F; is 


p= laze trn] = Jail’. 


The coefficients a; may therefore be called probability amplitudes for energy. The 


fact that an energy measurement must yield either F; or Ex leads to the condition 
that 


la|? + la|? = 1. (6.3) 
You will see shortly that Equation 6.3 ensures that U(x, t) is normalized. 


We now come to a central idea in this chapter. For the wave packet in 
Equation 6.2, the probability density for position is 


W*(2,t)U(x,t) = Jait (a)et E + agyg(a)eti 2] 


x [ara (x)e + aztala] 
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A+B 


Multiplying out the brackets and using the fact that efe? = e , we obtain 


W* (a, tU (wt) = Jar Poy (dha (a) + laal? (e)la) 
+ ajanwi (w)eba(a)e Me 


+ aha, wh (£) (x)e PEA, (6.4) 


where the first two terms come from multiplying corresponding terms in the 
two brackets, and the second two terms are cross-product terms, obtained by 
multiplying the first term in one bracket by the second term in the other 
bracket. These cross-product terms are important in our story because they are 
time-dependent; they ensure that the probability density depends on time, and so 
account for motion in wave mechanics. Motion is restored! 


Exercise 6.l Show that the linear combination of stationary-state wave 
functions in Equation 6.2 is normalized, with [> U* (x,t) (a, t) dx = 1, 
provided that the coefficients a, and ag satisfy Equation 6.3. | 


6.1.3 A wave packet in a harmonic well 


To take a definite example, we will now look more closely at a wave packet in a 
specific system, the harmonic oscillator. Our system consists of a particle of mass 
m that is subject to a potential energy function V (x) = $C x? = imwgr?, where 
wo = y/C'/m is the classical angular frequency corresponding to the classical 
period of oscillation, T = 2r /wo. 


In quantum mechanics, the energy levels of the harmonic oscillator are given by 
En = (n+ 5) hwo, where n = 0,1,2,... and the first two energy eigenfunctions 
are 


1 1/2 2 79,2 
wo(x) = ( =) e77/2a" and (6.5) 
1 1/2 2x 2/92 
_ anh r /2a 
vi(x) = (=) 7° (6.6) 


where a = ,/h/mw, is the characteristic length parameter of the oscillator. 


We shall look at a specific wave packet with equal coefficients for the first and 
second stationary-state wave functions: 

1 
V2 
Consider what happens at some particular point x. Since the two components 
have different time factors, there are times when they have the same sign at x, and 
other times when they have opposite signs. In effect, they sometimes add together 
and sometimes subtract, leading to a sum that seems to ‘slosh’ from side to side 
in the well, as shown in Figure 6.1. This figure represents UV, (x, t) at two 
convenient times, t = 0 in the top panel, and t = T/2 in the lower panel, where T 
is the classical period of the oscillation. The real and imaginary parts of the wave 
function are shown, together with the probability density |W, (x, t)|?. 


Wal et) = 


See Table 5.1 on page 134. 


Remember that Ey = shu and 
[tolre + dy (ae See?) (6.7) E= Zhuo. 
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Figure 6.1 |W, (z,t)|? and 
the real and imaginary parts 

of Ya (x,t) for t = 0 and 

for t = T/2, where T is the 
classical period, 27 /wo. Note 
that the imaginary part is zero at 
t = 0 and the real part is zero at 
t = T/2. The graphs are plotted 
as a function of x /a where a 

is the characteristic length 
parameter of the harmonic 
oscillator. 
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At time t = 0, both the time-dependent phase factors in Equation 6.7 are equal to 
1, so the wave function is 


1 
Wa(z,0) = — r)+u1(ax)]. 6.8 
A(2, 0) z ol ) + yı (2)] (6.8) 

At time t = T/2 = m/w, the phase factors are 

e wot /2 = eit /2 = —i and e73iwot/2 — e73i7/2 — +i, 
so the wave function is 

i 
Valz, T/2) = — [-yo(x) + pı(x)] ; (6.9) 


V2 


Note the overall factor of i in the second equation; this makes no contribution to 
the probability density |W, (x, T/2)|?; a more significant fact is that in 
Equation 6.8 the yo(x) and w(x) terms are added, but in Equation 6.9 they are 
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subtracted. The result is the ‘sloshing’ from side to side shown in Figure 6.1. You 
can see that the probability of finding the particle on the right-hand side of the well 
is large at t = 0 and the probability of finding the particle on the left-hand side is 
much smaller. Half a classical period later, exactly the reverse is true: the particle 
is much more likely to be found on the left-hand side of the well. The probability 
density ||? oscillates periodically, and is no longer independent of time as it is 
for stationary states. Wave packets restore motion to quantum mechanics. 


Exercise 6.2 Evaluate Y4 (x,T) in the same form as Equations 6.8 and 6.9. 
Verify that the sign of Y 4 is reversed after one period. Also show that the sign is 
restored after two complete periods. 


Exercise 6.3 Use Equations 6.5 and 6.6 to derive explicit expressions for 
Wa(x,0) and UV, (x, 7/2). Give a brief qualitative explanation as to why these 
expressions are consistent with what you see in Figure 6.1. a 


Expectation values 


Figure 6.1 suggests that the time-dependence of the wave packet has at least some 
vestiges of the oscillatory behaviour of, say a mass on the end of a spring. 
Although the probability density |W, (x, t)|? contains complete information about 
this motion, it is more convenient to consider a single quantity — the expectation 
value of position — and see how this depends on time. For the given wave packet, 
the expectation value of position is given by 


(x) =|" Y4 (a, t) FW (a, t) da. (6.10) 
Using 
1 i : 
YÄ (x,t) _ Va [yi (ett + pi (z)e 30t] 


= 
V2 


and combining phase factors, we see that 


@)=4 f WaR f piate) ar 


Wa(z, t) = [Yo(z)e-¥"/? $ th (ae Sot/2 


+ gerber fila) Rda(e) de + gett [Uz (e)Rvo(e) de. 


Fortunately, the remaining integrals can be evaluated without too much effort. The 
trick is to remember, from Chapter 5, that the position operator in a simple 
harmonic oscillator can be expressed as 

a 


V2 


where A and A! are lowering and raising operators. The lowering operator 
converts ppn into a multiple of Yn—1 (with Ayo = 0 as a special case) while the 
raising operator converts Yn into a multiple of Yp+1. The precise relationships for 


(A+ A, (6.11) 


x= 


The functions w(x) and y1 (x) 
happen to be real in this case, 
but we retain the stars because 
complex conjugation is needed 
in general; this would matter 
if the eigenfunctions were 
complex. 
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normalized energy eigenfunctions are: 


Adn(2) = Vn n1(2) (6.12) 
Aldn(2) = Vn F Lyna). (6.13) 


When we substitute Equation 6.11 into our expression for the expectation value 


of position, we get various integrals involving A+ A’. Before carrying out 
the substitution, we shall evaluate the integrals that will be needed. Using 
Equations 6.12 and 6.13, together with the orthonormality of the energy 
eigenfunctions, we obtain 


[A+R ede f ehad 
T Aa A+R ex(o)de = f v [vo(e) + V2v—(a)] ae = 0 
i Wo(a) (A + A") yi (£) da = m pola) [Yo (2) + Vžya(2)]| drai 
fila) A+B) vole) de =f enade. 


Finally, substituting Equation 6.11 into our expression for (x) and using these 
results, we conclude that 
(x) = aia [ze wot + zee] Z £ cos(wot). (6.14) 


V2 
@ Express this last equation in words. 
O For the given wave packet, the expectation value of x undergoes sinusoidal 


motion with amplitude a/\/2 and with the angular frequency wo of the 
corresponding classical harmonic oscillator. 


A similar calculation can be carried out for the expectation value of the 
momentum of a particle in a state described by the wave packet Y4 (x,t). As 
explained in Chapter 5, the momentum operator can also be expressed in terms of 
raising and lowering operators of the harmonic oscillator: 
pi —ih ~ cj 
DAS A-A). 
2= 7,8 - AN 


We can therefore follow the same steps as before, noting that Al now appears with 
a minus sign. The only difference this produces is to give 


f UOC- aisi 


You may choose to follow through all the details for yourself or, if you are short 
of time, simply accept the final result which is 


(6.15) 


Remember 


—ig — —ih 1,—iwot _ 1aiwot] — _ h 
Sir- l l (py = Fa [že ze ] Jia sin(wot). (6.16) 
Here, again, we see something reminiscent of classical behaviour: the expectation 
value of momentum oscillates sinusoidally with the angular frequency wo of the 
corresponding classical harmonic oscillator. 


154 


6.1 Time-dependence in the quantum world 


Exercise 6.4 Show that the expectation values x and py, given by 


Equations 6.14 and 6.16 satisfy 
d(x) 
(Px) = M 


6.1.4 More general harmonic-oscillator wave packets 


So far, we have considered a specific wave packet, with equal coefficients for 

the first two stationary states of a harmonic oscillator. You have seen that the 
probability density for position sloshes from one side of the well to the other with 
a period equal to T’, the classical period of oscillation. Moreover, the expectation 
values of position and momentum vary sinusoidally, in close analogy with 
classical simple harmonic motion. It is natural to ask whether these features are 
true for all wave packets describing the state of a particle in a simple harmonic 
well. 


A general wave packet describing a particle in a harmonic oscillator is of the form 
[oe] [oe] 
P(x, t) = >» AnWV,(x,t) where > jan? = 1. (6.17) 
n=0 n=0 


In many cases, only a finite number of the a,, are non-zero, but Equation 6.17 
covers all possibilities. Since Ep = (n + 5) hwo, the harmonic-oscillator 
stationary states are 


W(x, t) = Yn (rje Eth = Yn(x)je 2ntViwot/2 
Now, it is helpful to remember that yo(x), Y2(x£), Ya(x), ... are even functions of 


x, while w(x), w3(x), W5(x),... are odd functions of x. This allows us to write 
the wave function in Equation 6.17 as the sum of even and odd parts: 


U(x, t) = Veven( x, t) a Woda 2, t); 
where 


Weven(x, t) = aoyo(z)e™™ 0/2 + aga(a)e tl? +... 
Vodalz, t) = arypi(s)e™™0t/? + agya(x)e Mel? + 


At time t = T/2 = m/w, the phase factors in the even part of the wave packet 
become 

ew in/2 = eo in/2 aay eee 
while the phase factors in the odd part of the wave packet become 

ee a L = +i. 
We therefore have 

Wega, T/2) = —iYeven(x,0) and Yogalx, T/2) = +iYoga(x, 0), 
and hence 

P(x, T/2) = —i [Veven (z, 0) — Yoaa lz, 0)] 


= —i [Yeven(—x, 0) + Voaa(—2, 0)] 
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giving 
|@(«,7/2)|? = |E (-z,0)ļ*, 


which is exactly what we discovered in Exercise 6.3 for the simple wave packet 
made up with just the first two energy eigenfunctions. 


This shows that, no matter what the initial form of the wave packet, the 
probability density at time t = T /2 becomes a reflected version of the probability 
density at t = 0. If the wave packet was concentrated mainly on the right-hand 
side of the well at time t = 0, it becomes concentrated mainly on the left-hand 
side of the well at time t = T/2, where T is the classical period of oscillation. 


A similar analysis shows that the wave function changes sign after a time T’, so 
that Y (x, T) = —W(x,0), but this change in sign has no physical consequences 
because |W (x, 7) |? = |W(x,0)|?. After two classical periods, the wave function 
returns to its initial value: U(x, 2T) = U(x,0). We therefore conclude that the 
probability density describing the position of the particle generally sloshes from 
one side of the well to the other, endlessly repeating this motion at the classical 
period T = 2r /wo. This typical motion is shown in Figure 6.2, which shows the 
real and imaginary parts of the wave function, together with the probability 
density |U|?, at a number of different times during the classical period. 


Figure 6.3a shows the time-dependence of (a), and Figure 6.3b shows the 
time-dependence of (pz), for the wave packet shown in Figure 6.2. Both these 
quantities vary sinusoidally with time, in agreement with our findings for the 
two-component wave packet discussed earlier. Remember from Equations 6.14 
and 6.16 that (a) and (pz) have the same period, T, as the classical oscillator. 
Comparing the two parts of this figure, you can see that (px) is zero when (2) is at 
one of its extreme values. This is analogous to the motion of a pendulum which 
comes instantaneously to rest at the extreme points of its swing. 


The error bars in Figure 6.3a indicate the uncertainties in position at selected 
times, while those in Figure 6.3b show the uncertainties in momentum. You can 
see that Az is largest when Ap, is smallest. By comparing Figures 6.2 and 6.3, 
you can also see that Ax is largest when the wave packet is visibly broadest. It is 
as if the wave packet is breathing in and out as it moves to and fro. 


It is interesting to note that wave packets can also breathe in and out without 
moving to and fro. This happens if the wave function is initially symmetric (or 
antisymmetric) about the centre of the well. Such wave functions always remain 
symmetric (or antisymmetric) and so do not oscillate from one side of the well 
to the other. For such a wave packet, the expectation values of position and 
momentum are permanently equal to zero. 
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Figure 6.3 The time-dependence of (a) the expectation value (x) of position 
and (b) the expectation value (py) of momentum for the wave packet of 
Figure 6.2. At selected times the uncertainties Ax and Ap, are indicated by 
error bars. 


Figure 6.2 Snapshots of a 
wave packet in a harmonic 

well at times t = 0, t = T/8, 
t=7/4 t= 37 6,8 = 772, 
t= bT t= TA tTI 
and t = T, where T = 2r /wọ is 
the classical period of 
oscillation. The real part of the 
wave function is shown in 

blue, the imaginary part in 
green and the probability 
density |W|? in black. The wave 
function changes sign after one 
classical period, but this has no 


observable consequences. 157 
7 


Figure 6.4 Paul Ehrenfest 
(1880-1933). Ehrenfest was a 
student of Boltzmann and a 
close friend of Einstein. He 
proved his famous theorem in 
1927. 
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Exercise 6.5 A wave packet in a harmonic potential energy well takes the form 
P(x, t) = ary (x)e3ot/2 + agya (x)e Tiwot/2 


where a, and a3 are constants. Show that, at all times, the probability density 
|W (a, t)|? is symmetrical about the centre of the well and the expectation value 
of position is equal to zero. What type of motion does this describe? | 


6.2 Ehrenfest’s theorem and the classical limit 


You have seen several examples of wave packets that describe the motion of 
particles in a harmonic potential energy well. These examples suggest that the 
expectation values of position and momentum vary sinusoidally with time (as in 
Figure 6.3) or remain constant in time (as in Exercise 6.5). This behaviour is 
intriguing because we know that a classical harmonic oscillator has values of 
position and momentum that vary sinusoidally in time, or remain constant. 


This section will introduce an important theorem due to Paul Ehrenfest 

(Figure 6.4) which sheds light on this apparent coincidence. Later in the course, 
we shall prove Ehrenfest’s theorem from general quantum-mechanical principles. 
It would be too much of a diversion to include the proof in this chapter, so we 
shall simply state the result here, and concentrate on its interpretation. You will 
see that Ehrenfest’s theorem provides a key link from the quantum world to the 
more familiar everyday world of classical physics. 


6.2.1 Ehrenfest’s theorem 


Ehrenfest’s theorem is a general statement about the rates of change of (x) and 
(px) in quantum-mechanical systems. It can be stated as follows: 


Ehrenfest’s theorem 


If a particle of mass m is in a state described by a normalized wave function 
in a system with a potential energy function V(x), the expectation values of 
position and momentum of the particle obey the equations 


d(x) (pa) 
a oer 
dP) / OV 
Ee dt ( Ox ) ; 


So far, in this chapter, we have concentrated on wave packets in harmonic 
oscillators, but it is important to note that Ehrenfest’s theorem is a completely 
general result, valid throughout quantum mechanics. Moreover, it is an exact 
result — it does not rely on any approximations. 


Perhaps the most obvious feature of Ehrenfest’s theorem is that it looks very like 
Newtonian mechanics. The first Ehrenfest equation (which we label E1) can be 
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rearranged to give 
d(x) 

dé’ 
which is similar to the familiar Newtonian equation, pr = mda/dt. The second 
Ehrenfest equation (which we label E2) can be expressed as 


d(pr) 
ee (Fr), (6.19) 


where we have defined Fẹ, = —OV/0z, in line with classical physics. So E2 is the 
quantum-mechanical analogue of Newton’s second law equating the rate of 
change of momentum to the applied force. However, there is one important 
difference between E1 and E2 and Newtonian mechanics. Ehrenfest’s theorem 
involves expectation values, while the corresponding Newtonian equations deal 
with the actual values of position, momentum and force. In quantum mechanics 
the description is necessarily less precise because the particle cannot have precise 
values of both position and momentum, according to the uncertainty principle. 


(Pr) =m (6.18) 


We shall examine the precise relationship between Ehrenfest’s theorem and 
Newtonian mechanics shortly. Before doing so, it is worth looking at some 
examples of Ehrenfest’s theorem in action, beginning with the case that has been 
discussed at length in previous sections — a particle of mass m in a harmonic 


potential energy well V (x) = 5Cx? = imuke? . In this case, we have 


əv ð 


Ti (4mwp2”) = mga. (6.20) 
Differentiating E1 with respect to time and then using E2 and Equation 6.20 we 
obtain 

d*(x) 1 d(pe) 1 /OV il 7 2 

d2? m dt  m\dr/ oe = —#9(2), 
which gives 
d?(x) 
ae we (x) = 0. (6.21) 


This is the usual equation of motion for a classical harmonic oscillator, but 
applied to (x) rather than to x. The general solution of this differential equation 
can be written as 


(x) = Asin(wot + ¢), (6.22) 


where A and ¢ are arbitrary constants. The expectation value of the momentum 
then follows immediately from E1: 
d(x) 


(Px) = mo mwgA cos(wot + Q). (6.23) 


Equations 6.22 and 6.23 confirm the sinusoidal time-dependences of (x) and (pz) 
found in the particular wave packet of Equations 6.14 and 6.16, and the more 
general case illustrated in Figure 6.3. You may recall that symmetric wave packets 
in a harmonic potential energy well do not move to and fro, but breathe in and out 
with (x) and (px) permanently equal to zero. Such behaviour is also consistent 
with Ehrenfest’s theorem; Equations 6.22 and 6.23 continue to apply, but with the 
constant A equal to zero. 
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Exercise 6.6 A particle of mass m is subject only to the downward force of 
gravity. With the x-axis pointing vertically upwards, the potential energy function 
is V(x) = mgz, where g is the magnitude of the acceleration due to gravity. 
What can be said about the expectation value of x for this particle? = 


From the above examples, you might be tempted to suppose that the 
time-dependences of (x) and (px) in quantum physics are always identical to the 
time-dependences of x(t) and pz(t) in classical physics. If so, you will need to 
think again! To show that this need not be so, we consider a particle of mass m 
subject to a potential energy function V (x) = Azt, where A is a constant. In this 
case, 

OV o 4 3 

a g (Ae ) = 4AT’, 
and Ehrenfest’s theorem gives 


d?’(x)  1d(pz) 1 (5) __4A 


dt? m dt m 


(2°). 


Now, there is no reason to suppose that (x3) is the same as (x)°. In general, these 
quantities are different, though the precise details will depend on the nature of the 
wave packet. This means that (x) satisfies a different differential equation from 
x(t) in classical physics. so there is no reason to think that the time-dependences 
of (x) and x(t) will be exactly the same. 


6.2.2 The classical limit of quantum mechanics 


Quantum mechanics is believed to be a more accurate description of the world 
than classical mechanics. This is certainly true for systems on the scale of atoms 
and nuclei, for which classical physics runs into severe difficulties. However, we 
know that classical physics still provides an excellent description of macroscopic 
phenomena, on the scale of footballs, cars or planets. No one would think of 
solving Schrédinger’s equation to predict the motion of a planet; it is much easier, 
and perfectly satisfactory, to use classical physics for this. Nevertheless, a point of 
principle remains: if quantum physics is a truly universal theory, it should be able 
to explain all phenomena — the motion of a planet as well as the motion of an 
electron. This suggests that the predictions of quantum mechanics should reduce 
to those of classical mechanics in a suitable ‘classical limit’. In rough terms, this 
classical limit corresponds to bodies that are very much larger than atoms. 


Ehrenfest’s theorem helps explain how this classical limit is approached. 
Although E1 and E2 look very similar to the basic equations of Newtonian 
mechanics, you have seen that there is one important difference: Ehrenfest’s 
equations involve expectation values while Newton’s equations do not. In the 
classical limit, this difference must become unimportant. In taking the classical 
limit we need to make some approximations. The approximations are not part of 
Ehrenfest’s theorem, but they are needed to bring the predictions of Ehrenfest’s 
theorem into line with those of classical physics. 


The first point to note is that classical physics assumes that a particle has a 
definite position and momentum at each instant in time. In the classical limit, the 
uncertainties of position and momentum associated with quantum-mechanical 
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wave packets must therefore be negligible. This is not hard to achieve: 

a wave packet describing the centre of mass of a pendulum bob could have 

Av ~ 10- mand Ap, œ 107}? kg ms™t. These uncertainties are consistent 
with the Heisenberg uncertainty principle, but we cannot measure the position or 
momentum of the bob to such precisions so, from a practical point of view, the 
uncertainties are insignificant. 


We also need to consider the expectation value (OV/0x). Combining E1 and E2 
we obtain 
d? (x) OV 
= 6.24 

Oe Ox ae 
For a very narrow wave packet, centred on the point x = X at time t, the 
expectation value (x) is essentially equal to X, and this can be interpreted 
classically as ‘the position of the particle’. Writing the partial derivative OV/Ox 
as V’(x), the right-hand side of Equation 6.24 is given by the sandwich integral 


E (2) 2 L edv U(2, t) de. 


—0Co 


Because the wave packet is very narrow, only the region around x = X makes a 
significant contribution to this integral. Provided that V’ (x) varies slowly enough 
across this region, we can approximate V’(x) by the constant value V'(X), and 
then take this constant value outside the integral. Then, because the wave function 
is normalized we obtain 


OV 1 a * 1 

—({(—)=-V"(X) W* (x, t)U (x,t) dx = —V"(X). 
Ox = 

This can be thought of as the force on the particle at position X. Equation 6.24 is 

then interpreted as Newton’s second law: mass x acceleration = applied force. 

The important point to remember is that: 


The classical limit applies to wave packets with narrow spreads of position 
and momentum subject to potential energy functions for which OV /Ox 
varies slowly over the width of the wave packet. 


These conditions are generally met for macroscopic objects (which have very 
short de Broglie wavelengths) but are seldom valid for microscopic particles such 
as atoms or electrons. 


6.3 Predicting the motion of a wave packet 


We now turn to consider the problem of predicting the time-development of a 
wave packet. This is analogous to the classical problem of releasing a particle 
with some initial position and velocity, and predicting its future motion. 


To take a definite case, we shall continue to discuss a particle of mass m in a 
harmonic potential energy well. Suppose that, at time t = 0, the state of the 
particle is described by a linear combination of harmonic-oscillator energy 
eigenfunctions 


W(x,0) = agvo(x) + aii (x) +... (6.25) 
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where 7,,(x) is an energy eigenfunction of the harmonic oscillator with 
eigenvalue En = (n+ 5) hwo. Then, we can ask what the wave function will be at 
some later time t. We shall assume (correctly) that there is a unique answer to this 
question. In other words, if we know the wave function at time t = 0, we assume 
that the time-development of the wave function is completely determined by 
Schrédinger’s equation. 


The assumption of a unique answer is liberating. It means that, if we can write 
down a wave function Ẹ (x,t) that (i) satisfies Schrédinger’s equation for the 
given system and (ii) is equal to the known form of the wave function at time 

t = 0, then this must be the wave function of the system — there can be no other. 
Now, it is easy to see that the wave function 


U(x, t) = anwo(x)e*204/" + ay ary (xe E... (6.26) 


is a solution of Schrédinger’s equation. This is because each term in the sum is a 
stationary-state wave function that satisfies Schrédinger’s equation and, by the 
principle of superposition, any linear combination of solutions of Schrédinger’s 
equation is itself a solution. What is more, if we substitute t = 0 in Equation 6.26, 
we recover Equation 6.25, so the particular linear combination of stationary-state 
wave functions in Equation 6.26 is equal to the known wave function at time 

t = 0. Since conditions (i) and (ii) are met, uniqueness allows us to conclude that 
the wave function in Equation 6.26 describes the state of the particle at all times. 


Now let us consider a slightly different case. Suppose that, at time t = 0, the state 
of the particle is described by the wave function U(x,0) = f(a), where f(x) isa 
known function, such as that in Figure 6.5. 


{x,0) = f(x) 


Figure 6.5 The wave function at t = 0 is ¥(x,0) = f(x), centred at point xo. 


How can we predict the time-development of the wave function in this case? We 
do this by assuming that it is possible to write f(x) as a linear combination of 
energy eigenfunctions. That is, we write 


f(a) = ago(x) + ary (2) +... = S > aifi(z). (6.27) 
i=0 
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Then, by the above argument, the wave function we are looking for is given by 
Equation 6.26. The only snag is, we do not know what the coefficients ag, a1,... 
are. 


@ What is the physical significance of the coefficient a;? 

O It is the probability amplitude for obtaining the energy eigenvalue Æj in the 
initial state Y(x,0) = f(x). The probability of obtaining the energy E; is 
|a;|?. 

To find the coefficients agp, a1, ... we can use a method introduced in Chapter 4. 
We multiply both sides of Equation 6.27 by y} (a) and integrate over all x. This 
gives 


T Yj (a) f(z) da = >a A W; (x) pi(x) da. (6.28) 
Eo i=0 =00 


We then use the fact that the energy eigenfunctions are orthonormal. Expressed in 
terms of the Kronecker delta symbol, this means that 


f Wj (x) pi(x) dz = ĝji. (6.29) 


Inserting this result into Equation 6.28 and noting that the Kronecker delta kills 
off all the terms on the right-hand side except the term with i = j, we obtain 


A Papeis ay. 


Turning this equation around, replacing j by i, and recalling that f(x) = (x, 0), 
we conclude that 


i = wz (x) U(a, 0) da. (6.30) 


You may recognize this equation. It tells us that the probability amplitude for 
measuring energy £; in the initial state is given by the overlap integral of the ith 
energy eigenfunction with the initial wave function. In the context of our present 
discussion, however, the importance of this result is that it gives us a way of 
finding the coefficients needed in the expansion of the wave function. 


We have now solved the following basic problem in quantum mechanics: 

If a system starts in a given state, described by an initial wave function 

W(x,0) = f(x), how does the wave function evolve in the future? The answer is 
given by the following three-step procedure: 


1. Expand the initial wave function as a sum of energy eigenfunctions 
(Equation 6.27). 


2. Use the overlap integral to find the coefficients a; in the sum (Equation 6.30). 


3. Construct the wave function at an arbitrary time ¢ by including the 
appropriate time-dependent phase factors, (Equation 6.26). The energy 
eigenfunctions in Step 1 are replaced by stationary-state wave functions to 
give 


UE) ae em (6.31) 
c=) 
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The energy eigenfunctions used in Step 1 are, of course, those appropriate to the 
system under study — products of polynomials and Gaussian functions for a 
harmonic oscillator, sinusoidal functions for an infinite square well, and so 

on. In general, the task of evaluating the coefficients is lengthy, and is not 
something you will be asked to do. However, the coefficients can be evaluated 
quite painlessly using a computer, and this is how data for the graphs in Figure 6.2 
were calculated. 


Finally, one mathematical detail must be cleared up. We have assumed that it is 
possible to expand the initial wave function f(x) in terms of the appropriate 
energy eigenfunctions. You might wonder whether this assumption is justified. It 
turns out that any reasonable function f(x) can be expanded as a linear 
combination of energy eigenfunctions, although infinitely many terms may be 
needed in general. Mathematicians have proved this fact in various cases, and it 
also makes good physical sense because the coefficients in the sum have the 
physical meaning of being probability amplitudes for the various possible energy 
eigenvalues. In mathematical language, the set of energy eigenfunctions is said to 
form a complete set — which is just another way of saying that any reasonable 
function can be expanded in terms of them. 


Exercise 6.7 We stated earlier that if a wave packet in a harmonic potential 
energy well is initially symmetric about the centre of the well, it will remain 
symmetric at all times. Use Equations 6.26 and 6.30 to confirm this fact. E 


Comparison with classical physics 


In classical physics, the initial state of a particle is specified by its initial position 
and momentum. Newton’s laws then allow us to deduce the position and 
momentum of the particle at all future times. For example, we can move a 
pendulum bob to one side and release it from rest; Newton’s laws will predict the 
subsequent to-and-fro motion. 


In quantum mechanics, the initial state of a particle is specified by giving its initial 
wave function. Schrödinger’s equation then allows us to deduce the wave function 
of the particle at all future times. The methods used above may have disguised the 
fact that Schrödinger’s equation is involved, but this is only because we have 
found a crafty way of solving Schrödinger’s equation, using the principle of 
superposition to represent the solution as a linear combination of stationary-state 
wave functions. 


It is interesting to compare the ways in which the initial conditions are specified. 
For a particle in one dimension, Newtonian mechanics requires two real numbers, 
x(0) and p,(0), but quantum mechanics requires much more information — a 
function (x, 0), which may be complex. It is worth reflecting on why this is so. 
All that can happen to a classical particle is that it can move from one point to 
another, but a quantum-mechanical wave packet undergoes a more complicated 
evolution in which the whole probability distribution shifts with time, moving, 
spreading and possibly developing peaks and troughs; this more complicated 
time-development is related to more complicated initial conditions. 


In classical physics, we can release a particle at rest from an initial position 
x = Xo, and then calculate its subsequent motion in a harmonic well. What is the 
quantum version of this calculation? The closest we can get is to consider a 
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narrow wave packet that is initially centred on the point x = Zo (as in Figure 6.5). 
To correspond to starting from rest, we assume that the expectation value of 
momentum is equal to zero at the instant of release. We can then use the methods 
outlined above to find the coefficients of different harmonic-oscillator energy 
eigenfunctions in the initial wave packet, and construct a linear combination of 
stationary-state wave functions that tells us how the wave packet develops in time. 
In the classical limit, when the wave packet has extremely narrow spreads 

of position and momentum, the detailed shape of the wave packet becomes 
unimportant and all that really matters is (x) and (pr). In this limit, the 
predictions of quantum physics become indistinguishable from those of classical 
physics, as you saw earlier in the discussion of Ehrenfest’s theorem. 


One detail is worth further thought. How can we ensure that an initial wave 
function has (p, = 0, corresponding to the classical notion of releasing a particle 
from rest? One way of achieving this is to take the initial wave function to be 
real, as we shall now show. Let the initial wave function be U(x,0) = f(x), 
where f(x) is a normalized real function. Then the initial expectation value of 
momentum is 


(AE a W*(x,0)( z in) W(0,0) da 


=o) 


zei i > a da, 


where we have used the fact that f(x) is real to take f*(x) = f(x). Since f(x) is 
a function of x only, the partial derivative can be replaced by df /dxz. Now we can 
rewrite the integral as 


(vs) =—in [3 [P(@)] ade 


= 2dr 
which, on integration, gives 
ih 
(2) = -F [P @]2 =0, 


because f?(x) must go to zero at both +oo and —oo in order for f(z) 
be normalized. It therefore follows that (p+) = 0 in any state described by a 
real-valued wave function. 


Choosing the initial wave function to be real is only one way of ensuring that the 
wave packet starts from rest. In the case of a harmonic oscillator, any wave packet 
that does not contain contributions from any neighbouring energy eigenfunctions 
(such as 7;(x) and w;+41(x)) has zero expectation value of momentum, as the 
following exercise illustrates. 


Exercise 6.8 Show that the expectation value of momentum is equal to zero in 
a State described by the wave function 


W(a,0) = agwe(x) + ass (x), 


where w(x) and ~5(x) are energy eigenfunctions in a harmonic oscillator, 
and ag and as are complex constants. O 
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Computer simulations: wave packets in wells 


Now is the ideal time to study the software package Wave packets in wells. 
This is designed to help you visualize the wave packets we have been 
describing. Part of this package reviews the topics covered so far, but you 
will also be able to explore some additional topics in a fairly open-ended 
way. Here is a brief summary of some things you can do with this package: 


e Set up your own combinations of harmonic oscillator eigenfunctions to 
explore how wave packets, probability densities and expectation values 
of position and momentum vary with time in a harmonic oscillator. 


e Study a special type of harmonic-oscillator wave packet (known as a 
coherent-state wave packet) which behaves very much like a classical 
particle. 


e Compare the behaviour of wave packets in a harmonic oscillator 
potential well with those in an infinite square well. There are many 
similarities, but the differences include the fact that (x) and (px) in an 
infinite square well do not always vary sinusoidally in time. 


6.4 Free-particle wave packets 


Finally, we consider a wave packet that describes a free particle, i.e. one that is 
not subject to any force. At first sight, you might think that it would be trivial to 
describe the motion of a free particle, but this is not always so in quantum 
mechanics. We face two special difficulties: (i) the allowed energies of a free 
particle form a continuum and (ii) the energy eigenfunctions of a free particle 
cannot be normalized. Nevertheless, you will see that the approach used earlier to 
describe wave packets in a harmonic oscillator can be adapted to free particles, 
provided we judiciously replace sums by integrals. One of the things we will be 
able to do is explain how electrons passing through a narrow slit give rise to a 
characteristic single-slit diffraction pattern. 


6.4.1 Free-particle stationary states 


Before constructing wave packets for a free particle, we will first look at the 
building blocks from which these wave packets are made — the energy 
eigenfunctions and de Broglie wave functions of free particles. The Hamiltonian 
operator for a free particle in one dimension is 

< R o? 

H = 


~ 2m Ox?’ 


and the corresponding time-independent Schrödinger equation is 


-— — = Ey(zx). (6.32) 


6.4 Free-particle wave packets 


The energy eigenfunctions are functions that satisfy this equation together with 
the required boundary conditions (i.e. not diverging as x — +00). There are no 
satisfactory solutions for Æ < 0, but solutions exist for all EF > 0. 


The solutions to Equation 6.32 can be expressed in various alternative forms. 
From your experience with an infinite square well, you might expect these 
solutions to be expressed as sines or cosines. However, for the purpose of 
describing a free particle, it is much more convenient to write the solutions as 
complex exponentials. That is, we take 


a(x) = De®”, 
where D is a constant and k can have any real value from —oo to +00. The 


advantage of writing the eigenfunctions in this form is that they are eigenfunctions 
of momentum as well as energy. 


Worked Example 6.1 Essential skill 
(a) Verify that y(x) = De'** is an eigenfunction of momentum and Verifying properties of 
determine its eigenvalue. momentum eigenfunctions 


(b) Show that y(x) is also an eigenfunction of the free-particle Hamiltonian 
operator and determine the corresponding eigenvalue, Æ. 


(c) What is the degree of degeneracy of each energy level? 


(d) Show explicitly that y(x) cannot be normalized. 


Solution 
(a) Applying the momentum operator to the given function, 


~in De = —ifi(ik) De** = hkDe'**, 


so De'**” is an eigenfunction of momentum with eigenvalue hk. 


(b) Applying the free-particle Hamiltonian operator to the given function, 
[ic ees i i TAS s 
TET) io A ik 2D ikz _ Dek? 
Im Ox2~ om eee aa 
so De'** is an eigenfunction of the free-particle Hamiltonian operator with 
eigenvalue E = A?k? /2m; this is just the kinetic energy associated with 
momentum ñk. 


(c) There are two values of k for each energy eigenvalue E = h?k?/2m. 
These values are kı = V2mE/ħ and k2 = — vV 2m E /ħ. They correspond to 
two different energy eigenfunctions, Det” and Dei*2”, characterized by 
momenta of equal magnitudes but opposite directions along the z-axis. So 
each energy level is doubly-degenerate. 


(d) We evaluate 


[0,6] i (oe) 
| peen D dex 
—0o =o 


which is infinite. There is no choice of D that will yield a finite (and 
unity-valued) integral, so the function cannot be normalized. 
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Although the function De!*” cannot be normalized, it is helpful to choose a 
particular value for the constant D. While this cannot help with the normalization, 
you will see that it simplifies some equations that will appear later. We choose 


1 ika 
pple) Jin err, (6.33) 
The index k runs over a continuous range of real values, from —oo to +00. The 
function p(x) is an eigenfunction of momentum with eigenvalue hk, and it is 
also an eigenfunction of the free-particle Hamiltonian operator (an energy 
eigenfunction) with eigenvalue h?k?/2m. For brevity, we will generally refer to 
wr(xz) as a momentum eigenfunction. 


The stationary-state wave function corresponding to the eigenfunction 7/y,(x) is 


hek? 
W(x, t) = ypleje Eth where Ep = —. 
2m 
Using Equation 6.33, and combining exponentials, this can also be written as 
Ep hk? 


lo, 
Nii Hias i(ka—wpt) h S ha 
k(t) Jin e where w= = A 
Apart from the factor 1/v 2r, you should recognize this as a de Broglie wave 
function with wave number k and angular frequency wk, corresponding to 
momentum ñik and energy hwg. The wavelength of the de Broglie wave is given 
by à = 27/k, so high wave numbers correspond to short wavelengths. 


The trouble with de Broglie wave functions is that they cannot be normalized, so 
we cannot define a sensible probability density for position, let alone describe 
how this probability density changes when a particle moves. Put bluntly, this 
means that de Broglie waves are not acceptable wave functions for a single 
particle. However, you will soon see that we can construct normalized wave 
packets, using de Broglie waves as building blocks. 


6.4.2 Constructing a free-particle wave packet 


For a harmonic oscillator, we constructed a wave packet by taking a linear 
combination of harmonic-oscillator stationary states: 


U(z,t) = D 


where (x) is the ith harmonic-oscillator energy eigenfunction, with eigenvalue 
E;. By the principle of superposition, this wave packet satisfies Schrödinger’s 
equation for the harmonic oscillator. 


Now we shall do something very similar for a free particle, by taking a linear 
combination of free-particle stationary states (de Broglie waves). In fact, it does 
not help to take a discrete sum of de Broglie wave functions because such a sum 
would not produce a normalizable wave function. Instead, we integrate over a 
continuous set of de Broglie wave functions, each labelled by the continuous 
index k which represents the wave number. In other words, we consider the wave 
packet 


W(2,t) = f POOLA 
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where A(k) is a complex function and y,(x) is a momentum eigenfunction, 
which is also an energy eigenfunction with energy eigenvalue Ep. Using 
Equation 6.33, and combining exponentials, the wave packet can be expressed as 


1 a : 
P(x, t) = Tor f A(k) et(Fe—Fet/P) dk, (6.34) 


which is a linear superposition of de Broglie wave functions. Because each de 
Broglie wave function is a stationary-state solution of the free-particle 
Schrédinger equation, this linear combination must also satisfy the free-particle 
Schrödinger equation, by the principle of superposition. 


Exercise 6.9 Given that the wave function U(x, t) in Equation 6.34 satisfies 


2 92 o0 21,2 
he 04y _ 1 f Alk) hek eilke—Ext/R) gy 
2m ðr? Vr Jæ 2m 
oY 1 na , 
Poea AIW TE ei(ke—Ext/h) dz, 
show that U(x, t) satisfies the free-particle Schrödinger equation. a 


Now something rather wonderful happens. There is a famous mathematical result 
known as Plancherel’s theorem that we can exploit. The proof of this theorem is 
difficult, and we shall not attempt any justification here. Suffice it to say that the 
proof is a piece of pure mathematics that was given in 1910, before quantum 


mechanics had been established. In the notation of Equation 6.34, Plancherel’s Some authors call this 
theorem tells us that Parseval’s theorem. 
oO OO 
f |W (x,0)|? dx -f | A(k) |? dk. (6.35) 
—Cco —Cco 


Consequently, if we choose the function A(k) in such a way that 
[ a@Par=1, (636) 


then our wave packet will be normalized at t = 0; as always, this normalization 

is preserved at later times. Such a choice of A(k) is always possible and it 
completely solves the problem of non-normalizable de Broglie wave functions. To 
describe a free particle in quantum mechanics we must use a wave packet of the 
form given in Equation 6.34 with a function A(k) that satisfies Equation 6.36. 
This normalized wave packet does not have a definite momentum or a definite 
energy, but has a spread in both of these quantities. 


6.4.3 Interpreting A(k) as a momentum amplitude 


What is the physical significance of the function A(k) introduced above? 
To explore this question, it is helpful to recall the interpretation of a 
harmonic-oscillator wave packet: 


UT = > aaj A, 
i=0 
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In this case, the coefficient a; is interpreted as the probability amplitude for 
obtaining the value Æ; in the given wave packet, which means that the probability 
of obtaining the energy E; is |aj|?. 


Now, a very similar interpretation can be given to A(k), but with a slight 
difference because we are dealing with a continuous variable k, rather than a 
discrete index i. In fact we can say that | A(k)|? 6k is the probability of obtaining 
a wave number in a narrow range of width ôk, centred on k. We are usually more 
interested in momentum than wave number, but these two quantities are related by 
the simple formula p; = hk. Hence, the probability of finding the wave number to 
be in a narrow range ôk, centred on k is the same as the probability of finding the 
momentum pzy to be in a narrow range fh dk, centred on hk. We can therefore 
make the following statement: 


Born’s rule for momentum 


For a free-particle wave packet, the probability of finding the momentum of 
the particle to lie in a small range h ok, centred on hk, is | A(k)|? ôk. 


There is a strong similarity between this rule and Born’s rule for the wave 

function, which interprets |Y (x, t)|? da as the probability of finding the particle in 
A(k) is also called the a small range ôx, centred on x. For this reason, A(k) is called the momentum 
momentum wave function. amplitude. This term emphasizes the central role of | A(k)|? in determining the 

probability distribution of momentum. Note, however, that | A(k)|? is really a 
probability density per unit wave number. The probability density per unit 
momentum is |A(k)|?/h, which when multiplied by the small momentum range 
h ok, gives the probability | A(k)|? ok. 


While the expectation value (pz) can always be found by evaluating the usual 
sandwich integral 


+00 

(rn) =f We) By Wt) dr, 
—Cco 

it is also possible to calculate (py) directly from A(k), based on Born’s rule for 

momentum given above. The following exercise shows how this is done. 


Exercise 6.10 How would you express the expectation values of the 
momentum and kinetic energy of a free-particle wave packet in terms of the 
momentum amplitude A(k)? Do these expectation values depend on time? m 


6.4.4 Predicting free-particle motion 


It is easy to predict the motion of a free particle in classical physics, but the 
corresponding task is less straightforward in quantum mechanics. Given a wave 
function U(x,0) = f(x) that describes the initial state of a free particle at time 

t = 0, we need to predict how this wave function evolves in time. The method we 
shall use is similar to that outlined earlier for harmonic-oscillator wave packets. 


1. For a harmonic oscillator, our first step was to express the initial wave 
function as a linear combination of harmonic-oscillator energy 
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eigenfunctions (Equation 6.25). For a free particle, we expand the initial 
wave function ¥(x,0) = f(x) as a linear combination of momentum 
eigenfunctions (which are also energy eigenfunctions): 


w(e,0) = f A(R) dela) dk 


=O) 


1 ao . 
— A ika ; 37 
l. (k) e*? dk (6.37) 


An integral, rather than a sum, is needed here because k can have any real 
value between —oo and +00. 


2. For a harmonic oscillator, the coefficients a; were determined by using an 
overlap integral (Equation 6.30). The coefficients A(k) in a free-particle 
wave packet can be determined in a very similar way. It turns out that 


A(k) = T p(x) U(x, 0) dx 


2 = / _ (a, 0)e der (6.38) 


3. Finally, as for a harmonic oscillator, we construct the wave function U(z, t) 
by inserting the appropriate time-dependent phase factors. This has the 
effect of replacing the momentum eigenstates in Equation 6.37 by de 
Broglie wave functions: 


P(x, t) = a i Alk) w(x) eth dk 


1 oo , 
so Alk eikx e`iEkt/A dk 
V 27 J. ( ) 


T J AER dk, oo 


where Ep = R k? /2m. 


The steps in this procedure can all be rigourously justified using a branch of 
mathematics called Fourier analysis, created at the beginning of the nineteenth 
century by the French mathematician Jean Baptiste Fourier (Figure 6.6). 
Mathematicians have shown that the set of functions e** is complete, so any 
reasonable function f(a) can be expanded in terms of them, as in Equation 6.37. 
They have also shown that Equation 6.38 is a mathematical consequence of 
Equation 6.37, independent of any physical interpretation. We follow the standard 
mathematical convention of calling the right-hand side of Equation 6.38 the oe ay 
Fourier transform of U(x,0) and calling the right hand-side of Equation 6.37 AN i =p Lo 
the inverse Fourier transform of A(k). Using this terminology we can state that: i 


= 


nF 


Figure 6.6 Jean Baptiste 


Fourier (1768—1830) invented 
e The momentum amplitude A(k) is the Fourier transform of the initial Fourier analysis to predict the 


wave function U(x, 0). flow of heat. 


e The initial wave function (, (0) is the inverse Fourier transform of the 
momentum amplitude A(k). 
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Exercise 6.11 Use Equations 6.37 and 6.38 to suggest why it was sensible 
to include a factor 1/v 2r in our definition of the momentum eigenfunction 
(Equation 6.33). E 


6.4.5 Wave-packet spreading 


As an example of the method outlined above, let’s consider a free particle whose 
initial state is described by a normalized Gaussian function 


W(x, 0) = _ (LY? pp. 
T, ) — f(z) ~~ yra e , (6.40) 


sketched in Figure 6.7a. The significance of the constant a is that it gives a typical 
width for the initial wave packet (in fact, Az = a/ V2). 


The initial wave packet is real-valued, so the initial value of (p;,) is equal to zero. 
Because a free particle experiences no forces, Ehrenfest’s theorem tells us that 


p(B) 


so the expectation value of momentum remains permanently equal to zero for this 
wave packet. The wave packet cannot move bodily through space, but it can 
expand. This important phenomenon is called wave-packet spreading. 


To predict how the wave packet spreads, we first calculate A(k), using 
Equation 6.38 to take the Fourier transform of f(x). The integration is quite 
tricky, so we shall just quote the answer: 


A(k) = A e7k?/2, 


This function is sketched in Figure 6.7b. It is interesting to note that A(k) is also a 
Gaussian function, but of typical width 1/a. So there is a reciprocal relation 
between the widths of f(x) and A(k) as can be seen in Figures 6.7c and d, where 
we show an alternative U(a,0) with a smaller Ax and with a correspondingly 
more spread-out A(k). The narrower the spread of the initial wave packet f(x), 
the broader the spread of the momentum amplitude function A(k), and vice versa. 
This is nothing other than the Heisenberg uncertainty principle in action. 


Our initial wave function (Equation 6.40) has a typical width of order a. To see 
how the width of the wave packet grows with time, we put the function A(k) into 
Equation 6.39 to find Y (x, t), the wave function of the free particle at any given 
time t. The integrals are again tricky, so we shall just illustrate the final result. 
Figure 6.8 shows the wave packet at three different times after = 0; the steady 
spreading of the wave packet is evident. 


Figure 6.7 (a) A normalized Gaussian function f(x), representing the initial 
state of a free particle. (b) The corresponding momentum amplitude function 
A(k), found by taking the Fourier transform of f(x). (c) Another Gaussian with a 
smaller Ax. (d) The momentum amplitude function A(k) corresponding to (c). 


Figure 6.8 A free-particle wave packet shown at t = 0 and at three later times, 
with 0 < tı < tg < tg. The initial spread of the wave packet has been chosen to 
be comparable to the size of an atom. The real part of the wave function is shown 
in blue, the imaginary part in green and the probability density |W? in black. 
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Although the wave function develops real and imaginary parts, both of which 
have lots of wiggles, the modulus of the wave function turns out to be yet another 
Gaussian function 


1 1/2 2/282 
Uw = | — z 
wedl (5) eer 


with a width 
R242 1/2 
b= (e + 72) (6.41) 


that increases with time. The corresponding uncertainty in position is 
Ax = b/\/2. In the limit of large t, when the wave packet has spread far beyond 
its initial width a, we can make the approximation 
b _ Rh i 

V2 E V2am i 


so the width of the wave packet eventually becomes proportional to time. 


Ag (6.42) 


There is nothing mysterious about wave-packet spreading; it merely reflects the 
initial uncertainty in the velocity of the particle. The initial wave function 
corresponds to a spread of various possible momenta, with a momentum 
distribution determined by |A(k)|?, and this momentum distribution remains 
constant for a free-particle wave packet because there are no forces to change it. 
Corresponding to the spread in possible momenta, there is a spread in possible 
velocities, Av, = Ap,./m. This spread in velocities gives rise to an uncertainty in 
the position of the particle that increases with time. 


Ignoring the initial width of the wave packet, the uncertainty in position can be 
estimated to be 


Ag ~ Av t= t. (6.43) 


Ape 

m 
The wave packet in Equation 6.40 is Gaussian in shape, just like the ground state 
of a harmonic oscillator. Although we are not dealing with a harmonic oscillator 
here, the fact that our wave function has the same shape as the harmonic-oscillator 
ground state allows us to use the result 


= —_ Eqn 5.51 

oF (Eq ) 
obtained near the end of the Chapter 5. Combining Equations 6.43 and 5.51, we 
then recover Equation 6.42. 


It is worth noting that decreasing the spatial extent of the initial wave packet 
increases the spread of momenta and therefore increases the rate at which the 
wave packet spreads. If you want to construct a wave packet that remains compact 
for a long time, you should not start out with one that is too narrow. 


Exercise 6.12 A free electron is described at t = 0 by a Gaussian wave packet 
with Az = 1mm. Show that 5 hours later Ax ~ 1 km. E 
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6.5 Beyond free particles 


You have seen that the function |A(k)|? tells us about the distribution of 
momentum in a free-particle wave packet. But free particles are a very special 
case. It also makes sense to talk about the distribution of momentum in other 
types of wave packet — a harmonic-oscillator wave packet, for example. So far, 
we have not said how such a momentum distribution can be calculated. For 
completeness, we now tie up this loose end. 


A free particle has an unchanging distribution of momentum, but this is not true 
for a particle subject to external forces. You saw earlier that the expectation 
value of momentum varies sinusoidally in a harmonic-oscillator wave packet. 
Nevertheless, we can extend Born’s rule for momentum to cover all particles, 
provided we use a time-dependent momentum amplitude A(k, t). In general, the 
probability of finding the momentum at time t to lie in a small range A ôk, centred 
on hk is |A(k, t)|? ôk. 


The only remaining issue is specifying an appropriate function |A(k, t)|?. For free 
particles, we know that this is done by taking the Fourier transform of the initial 
wave function. Equation 6.38 shows that 


1 oe : 
—— U(x, 0)e** dz 
V 21 f. g ) 


This equation generalizes to non-free particles, simply by using the wave function 
at the appropriate time: 


2 


|A(k) j = for a free particle. 


2 
(6.44) 


|A(k, t)? = = T W(x, te** dz 


In general, this leads to a time-dependent distribution of momentum, reflecting the 
fact that external forces cause the momentum distribution to change. 


Figure 6.9 shows an example of a momentum distribution calculated by this 
method. It shows |.A(k, t)|? for the harmonic-oscillator wave packet of Figure 6.2 
at t = T/8. At the instant shown, a measurement of momentum would give a 
negative value with overwhelming probability, corresponding to the classical 
picture of a particle moving back towards the origin. Nevertheless, there is a small 
but non-vanishing probability of finding a positive momentum, corresponding to a 
particle that is moving away from the origin. Not for the first (or last) time, 
quantum mechanics confounds our classical intuition. 


tA, 7/8)? 


Figure 6.9 A graph of 
|A(k, t)|? against k for the 
harmonic-oscillator wave 
packet of Figure 6.2 at time 
t= T/S: 
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Computer simulations: free-particle wave packets 


Now is a good time to study the software package Free-particle wave 
packets. Here is a brief summary of some things you can learn from this 
package: 


e Free-particle wave packets with (p,) = 0 spread at a rate inversely 
related to their initial width. 


e Free-particle wave packets with (p,) # 0 spread in a similar way, but 
with a steadily moving ‘centre of mass’. 


e The process of single-slit diffraction of an electron wave can be 
analyzed, starting with a ‘top-hat’ initial wave function representing an 
electron that has just passed through the slit. The subsequent spreading 
of this wave packet produces a probability distribution for finding the 
electron at any given point on the screen. 


e Moving beyond free-particle wave packets we can consider a 
wave-packet that encounters a potential energy ramp. The momentum 
distribution then changes with time in response to the external force. 


Summary of Chapter 6 


Section 6.1 A wave packet is a normalized linear combination of two or more 
stationary-state wave functions. Because of the superposition principle, wave 
packets satisfy Schrodinger’s equation. Unlike stationary states, wave packets 
have probability densities that change in time, and so can describe motion. 


In a harmonic oscillator, the probability density associated with a wave packet can 
oscillate to-and-fro with the classical frequency. Other types of motion, such as 
symmetrical breathing in and out are also possible. The expectation values 

of position and momentum are either permanently equal to zero, or oscillate 
sinusoidally with time. 


Section 6.2 Ehrenfest’s theorem states that 


d(x) _ (px) d(pz) [OV 
dt m ae dt (N, 


This theorem is always true. The classical limit corresponds to a wave packet with 
narrow spreads of position and momentum, subject to a force that varies slowly 
over the width of the wave packet. In this limit, the predictions of quantum 
mechanics approach those of classical mechanics. 


Section 6.3 Given an initial wave function Ẹ (x, 0), it is possible to predict 
the time-development of this wave function using a three-step procedure. We 
illustrated this for a harmonic oscillator. First, we expanded the initial state as a 
sum of energy eigenfunctions ~;(x) of the oscillator. This is always possible 
because these eigenfunctions form a complete set. Then we determined the 
coefficients in the expansion, using the overlap integral 


ke a y? (2) W(x,0) de. 


Achievements from Chapter 6 


Finally, we introduced time-dependent phase factors into the sum, replacing each 
energy eigenstate ~;(a) by the corresponding stationary-state wave function 
pilex) eiFit/h, This provides a solution of Schrédinger’s equation that obeys the 
initial conditions, and so is the wave function we were looking for. 


Section 6.4 Free-particle wave packets can be constructed by taking a 
continuous linear combination of de Broglie wave functions: 


1 eQ . 
U(x,t) = Te f Alkil Ert» dk. 


This wave packet is normalized provided that the function A(k) obeys the 
condition [°° |A(k)|? dk = 1. The coefficients A(k) are called momentum 
amplitudes. They are interpreted with the aid of Born’s rule for momentum, 
which states that the probability of obtaining a momentum in a small interval of 
momentum ñ ôk, centred on hk, is |A(k)|? dk. 


The time development of a free-particle wave packet can be predicted using an 
extension of the three-step procedure used for a harmonic oscillator. For a free 
particle, the overlap integral becomes a Fourier transform: 


A(k) = = f * d(e Oe? da. 


By analyzing the motion of a free-particle wave packet, we predicted how 
a Gaussian packet spreads with time, and (in a computer simulation) how a 
single-slit diffraction pattern is formed. 


Achievements from Chapter 6 


After studying this chapter, you should be able to: 
6.1 Explain the meanings of the newly-defined (emboldened) terms and 
symbols, and use them appropriately. 


6.2 Give a general account of harmonic-oscillator wave packets and their role in 
accounting for motion in a harmonic oscillator. 


6.3 Calculate the time-dependences of (a) and (pz) for given 
harmonic-oscillator wave packets. 


6.4 State Ehrenfest’s theorem and give an account of its significance in relating 
quantum mechanics to classical mechanics. 


6.5 Describe the procedure used to predict the time-development of a 
harmonic-oscillator wave function, given its value at time t = 0. 


6.6 Describe how a free-particle wave packet is constructed, including the 
condition needed to ensure normalization. 


6.7 Interpret momentum amplitudes and use them in Born’s rule for momentum. 
6.8 Give an account of the spreading of free-particle wave packets. 
After studying the computer simulation packages, you should also be able to: 


6.9 Compare and contrast the behaviour of wave packets in a harmonic potential 
energy well and an infinite square potential energy well. 


6.10 Recognize that momentum amplitudes become time-dependent in wave 
packets that describe particles subject to forces. 
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Introduction 


You have now met two approaches to describing the state of a system in wave 
mechanics. In cases where the probability distributions are independent of time a 


ee stationary-state approach can be used. In other cases, where probabilities are 
or Waves time-dependent and motion is really taking place, a wave-packet approach can be 


used. The two approaches are related but different. In many situations the choice 
of approach is obvious and straightforward, but that is not always the case, as you 
will soon see. 


Q target In this chapter we shall consider two physical phenomena of fundamental 
importance: scattering and tunnelling. Each will be treated using both a 


stationary-state approach and a wave-packet approach. 
scattered y Pp P pp 


ao : : The phenomenon of scattering was an important topic in physics long before the 
development of wave mechanics. In its most general sense, scattering is a process 
in which incident particles (or waves) are affected by interaction with some kind 
of target, quite possibly another particle (Figure 7.1). The interaction can affect an 
incident particle in a number of ways; it may change its speed, direction of 
motion or state of internal excitation. Particles can even be created, destroyed or 
Figure 7.1 The phenomenon absorbed. 


of scattering. It can be argued that almost everything we know about the world is learnt as a 


result of scattering. When we look at a non-luminous object such as a book or a 
building we see it because of the light that is scattered from its surface. The sky 
is blue because the particles in the Earth’s atmosphere are more effective at 
scattering blue light (relatively short wavelengths) than yellow or red light (longer 
wavelengths). This is also the reason why sunsets are red (Figure 7.2). As the Sun 
approaches the horizon, its light has to traverse a lengthening path through the 
Earth’s atmosphere; as a consequence shorter wavelengths are increasingly 
scattered out of the beam until all that remains is red. 


Much of what we know about the structure of matter has been derived from 
scattering experiments. For example, the scattering of alpha particles from a gold 
foil, observed by Geiger and Marsden in 1909, led Rutherford to propose the first 
nuclear model of an atom. More recent scattering experiments, involving giant 
particle accelerators, have provided insight into the fundamental constituents of 
matter such as the quarks and gluons found inside protons and neutrons. Even 
our knowledge of cosmology, the study of the Universe as a whole, is deeply 
dependent on scattering. One of the main sources of precise cosmological 
information is the study of the surface of last scattering observed all around us at 
microwave wavelengths (Figure 7.3). 


Figure 7.2 Red sunsets are a 
direct consequence of the 
scattering of sunlight. 
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Figure 7.3 Microwave image of the surface of last scattering. 


In our detailed discussions of scattering we shall not consider cases where the 
scattering changes the number or nature of the scattered particles, since that 
requires the use of quantum field theory, a part of quantum physics beyond the 
scope of this book. Rather, we shall mainly restrict ourselves to one-dimensional 
problems in which an incident beam or particle is either transmitted (allowed to 
pass) or reflected (sent back the way it came) as a result of scattering from a 
target. Moreover, that target will generally be represented by a fixed potential 
energy function, typically a finite well or a finite barrier of the kind indicated in 
Figure 7.4. Despite these restrictions, our discussion of quantum-mechanical 
scattering will contain many surprises. For instance, you will see that a finite 
potential energy barrier of height Vo can reflect a particle of energy Ep, even 
when Ey > Vo. Perhaps even more amazingly you will see that unbound particles 
can be reflected when they encounter a finite well. 


cnergyA 


EET incident incident 
particles particles 
reflected transmitted reflected transmitted 
particles particles particles particles 


Figure 7.4 (a) Particles with energy Eo > Vo, encountering a finite barrier of 
height Vo, have some probability of being reflected. (b) Similarly, unbound 
particles with energy Fo > 0 can be reflected by a finite well. 


The phenomenon of tunnelling is entirely quantum-mechanical with no analogue 
in classical physics. It is an extension of the phenomenon of barrier penetration 
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that you met in earlier chapters. Barrier penetration involves the appearance of 
particles in classically forbidden regions. In cases of tunnelling, such as that 
shown in Figure 7.5, a particle with energy Eo < Vo can penetrate a potential 
energy barrier of height Vo, pass through the classically forbidden region within 
the barrier and have some finite probability of emerging into the classically 
allowed region on the far side. 


CNCKEY A 
incident transmitted 
particles particles 
reflected 
particles 


classically-forbidden 
region 


Figure 7.5 Particles with energy Eo < Vo, encountering a finite barrier of 
height Vo, have some probability of being transmitted by tunnelling through the 
barrier. Such a process is forbidden in classical physics. 


Tunnelling phenomena are common in many areas of physics. In this chapter you 
will see how tunnelling provides an explanation of the alpha decay of radioactive 
nuclei, and is also an essential part of the nuclear fusion processes by which stars 
produce light. Finally you will see how quantum tunnelling has allowed the 
development of instruments called scanning tunnelling microscopes (STMs) that 
permit the positions of individual atoms on a surface to be mapped in stunning 
detail. 


7.1 Scattering: a wave-packet approach 


This section discusses the scattering of a particle using wave packets of the type 
discussed in Chapter 6. We shall restrict attention to one dimension and suppose 
that the incident particle is initially free, described by a wave packet of the form 


P(x, t) = =f A(kjél -Ert dk. (7.1) 


Jin 
This is a superposition of de Broglie waves, with the function eltha— Bt) 
corresponding to momentum p, = Aik and energy Ey, = h?k?/2m. The 
momentum amplitude function A(k) determines the blend of de Broglie waves in 
the initial free wave packet, but when the wave packet encounters a change in the 
potential energy function, the blend of de Broglie waves changes, and scattering 
takes place. 


7.1 Scattering: a wave-packet approach 


7.1.1 Wave packets and scattering in one dimension 


Figure 7.6 shows the scattering of a wave packet, incident from the left, on a 
target represented by a potential energy function of the form 


V(x) = Vo 
Vig) =0 


forO<a<QJD (7.2) 


for x <Oanda > L. (7.3) 


Potential energy functions of this type are called finite square barriers. They 
are simple idealizations of the more general kind of finite barrier shown in 
Figure 7.4a. The de Broglie waves that make up the wave packet extend over a 
range of energy and momentum values. In the case illustrated in Figure 7.6, the 
expectation value of the energy, (F), has a value Eo that is greater than the 
height of the potential energy barrier Vo. The classical analogue of the process 
illustrated in Figure 7.6 would be a particle of energy Eo scattering from a 
repulsive target. The unrealistically steep sides of the potential energy function 
imply that the encounter is sudden and impulsive — not like the encounter 
between two negatively charged particles for instance; there is no gradual slope 
for the incident particle to ‘climb’, nor for it to descend after the interaction. Still, 
in the classical case, the fact that Eo is greater than Vo, implies that the incident 
particle has enough energy to overcome the resistance offered by the target, and it 
would be certain to emerge on the far side of it. 


The quantum analysis tells a different story. Based on numerical solutions of 
Schrédinger’s equation, the computer-generated results in Figure 7.6 show 
successive snapshots of |v”, each of which represents the probability density of 
the particle at a particular instant. Examining the sequence of pictures it is easy to 
visualize the probability as a sort of fluid that flows from one part of the x-axis to 
another. Initially, as the wave packet approaches the barrier, the probability is 
concentrated in a single ‘blob’, flowing from left to right. Then, as the wave 
packet encounters the barrier, something odd starts to happen; the probability 
distribution develops closely-spaced peaks. These peaks are a consequence of 
reflection — part of the quantum wave packet passes through the barrier, but 
another part is reflected; the reflected part interferes with the part still advancing 
from the left and results in the spiky graph of y}. Eventually, however, the 
interference subsides and what remains are two distinct ‘blobs’ of probability 
density: one returning to the left, the other progressing to the right. 


It’s important to realize that the splitting of the wave packet illustrated 

in Figure 7.6 does not represent a splitting of the particle described by the 

wave packet. The normalization of the wave packet is preserved throughout the 
scattering process (the area under each of the graphs in Figure 7.6 is equal to 1); 
there is only ever one particle being scattered. The splitting of the wave packet 
simply indicates that, following the scattering, there are two distinct regions in 
which the particle might be found. In contrast to the certainty of transmission in 
the classical case, the quantum calculation predicts some probability of 
transmission, but also some probability of reflection. Indeterminacy is, of course, 
a characteristic feature of quantum mechanics. 
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Figure 7.6 The scattering of a 
wave packet with (E) = Ep 
by a finite square barrier of 
height Vo when Eg > Vo. 

The probability density |W|? 

is shown in a sequence of 
snapshots with time increasing 
from top to bottom. The barrier 
has been made narrow in this 
example but a greater width 
could have been chosen. 
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Figure 7.7 The scattering of a 
wave packet with (EF) = Eo by 
a finite square well of depth Vo 
when Eo > 0. The probability 
density ||? is shown in a 
sequence of snapshots with time 
increasing from top to bottom. 
The well has been made narrow 
in this example but a greater 
width could have been chosen. 
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Exercise 7.1 Simply judging by eye, what are the respective probabilities of 
reflection and transmission as the final outcome of the scattering process shown in 
Figure 7.6? a 


The probability that a given incident particle is reflected is called the reflection 
coefficient, R, while the probability that it is transmitted is called the 
transmission coefficient, T. Since one or other of these outcomes must occur, we 
always have 


RITI (7.4) 


The values of R and T that apply in any particular case of one-dimensional 
scattering can be worked out by examining the solution of the relevant 
Schrödinger equation that applies at a time long after the incident particle has 
encountered the target. For properly normalized wave packets, the values of R 
and T are found by measuring the areas under the graph of y? that are located to 
the left and to the right of the target at that time. 


Reflection and transmission coefficients can also describe scattering from an 
attractive target. The idealized case of a wave packet with (E) = Fo > 0 
encountering a finite square well of depth Vo and width L is shown in Figure 7.7. 
Again, the wave packet is incident from the left, and the wave packet is partly 
reflected, giving rise to interference effects. Eventually, however, the interference 
abates, leaving two distinct parts of the wave packet. The areas of these two parts 
determine the reflection and transmission coefficients. 


Of course, wave mechanics is probabilistic, so although reflection and 
transmission coefficients can be calculated in any given situation, it is never 
possible to predict what will happen to any individual particle (unless R or T 
happen to be equal to zero in the situation considered). Indeed, the detection of 
the scattered particle, after the scattering has taken place, is exactly the kind of 
measurement that brings about the collapse of the wave function — a sudden, 
abrupt and unpredictable change that is not described by Schrddinger’s equation. 
After such a collapse has occurred, the particle is localized on one side of the 
target or the other. 


7.1.2 Simulations using wave packets 


Now would be a good time to study the computer package The scattering of wave 
packets in one dimension that was used to generate the probability density plots 
shown in Figures 7.6 and 7.7. This package will allow you to specify a range of 
barriers and wells (not necessarily square), and to observe the way in which they 
scatter Gaussian wave packets (wave packets for which ||? is shaped like a 
Gaussian function). 


The simulation is powerful and will reward detailed investigation. However, you 
should not spend too much time on it at this stage in your study of the chapter; 
thirty minutes should be sufficient for a first look. As you work through the 
package you should use its features, such as the ability to freeze the motion, or to 
advance the simulation step-by-step, to observe the following points: 


e The spreading of the wave packet before, and after, it encounters the barrier or 
well (a consequence of the range of momentum values that contribute to the 
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wave packet). 


e The reflection of the wave packet by different parts of the barrier or well, and 
the way this leads to interference effects within the wave packet. 


e The differences in behaviour between the real and imaginary parts of the wave 
packet and the way they combine to produce the probability density ae 


e The way in which the transmission and reflection coefficients depend on the 
energy of the incident wave packet and the parameters that define the barrier or 
well. 


7.2 Scattering: a stationary-state approach 


Scattering calculations using wave packets are so laborious that they are generally 
done numerically, using a computer. However, in many cases, scattering 
phenomena can be adequately treated using a procedure based on stationary 
states. This approach can give valuable insight into the scattering process without 
the need for computer simulations. 


This section introduces the stationary-state approach to scattering. The 
discussion is mainly confined to one dimension, so a stationary-state solution to 
Schrédinger’s equation can be written in the form W(x, t) = u(x)ei#"/", where 
p(x) satisfies the appropriate time-independent Schrödinger equation at energy F. 
The first challenge is to find a way of interpreting stationary-state solutions that 
makes them relevant to an inherently time-dependent phenomenon like scattering. 


7.2.1 Stationary states and scattering in one dimension 


The key idea of the stationary-state approach is to avoid treating individual 
particles, and to consider instead the scattering of a steady intense beam of 
particles, each particle having the same energy Ep. It is not possible to predict the 
exact behaviour of any individual particle but, if the incident beam is sufficiently 
intense, the result of the scattering will be reflected and transmitted beams 

with steady intensities that are determined by the reflection and transmission 
coefficients we are aiming to evaluate. Provided we consider the beams as a 
whole, nothing in this arrangement depends on time. A snapshot of the set-up 
taken at one time would be identical to a similar snapshot taken at another time. 
In contrast to the wave-packet approach, there are no moving ‘blobs of probability 
density’, so the whole process can be described in terms of stationary states. 


For a one-dimensional beam, we define the beam intensity 7 to be the number of 
beam particles that pass a given point per unit time. We also define the linear 
number density n of the beam to be the number of beam particles per unit length. 
Then, thinking in classical terms for a moment, if all the particles in a beam have 
the same speed v, the beam intensity is given by 7 = un. Specializing this 
relationship to the incident, reflected and transmitted beams, we have 


Jine = Vine Minc, Jref = Vref ref; Jtrans = Vtrans "trans - 


In the stationary-state approach, the reflection and transmission coefficients can 
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be expressed in terms of beam intensity ratios, as follows: 


Re mi Te, (7.5) 


Jinc Jinc 


If all the incident particles are scattered, and no particles are created or destroyed, 
it must be the case that Jinc = Jref + Jtrans. Dividing both sides by finc and 
rearranging, gives R + T = 1, as expected from our earlier discussions. 


We now need to relate these steady beam intensities to stationary-state solutions 
of the relevant Schrödinger equation. This requires some care, since we are 
familiar with the use of Schrödinger’s equation to describe individual particles but 
not beams of particles. To make the steps in the analysis as clear as possible, 

we shall begin by considering a particularly simple kind of one-dimensional 
scattering target. 


7.2.2 Scattering from a finite square step 


The kind of one-dimensional scattering target we shall be concerned with in this 
subsection is called a finite square step. It can be represented by the potential 
energy function 


V(x)=0 forz<0, (1.6) 
V(x)=V forz >0. (7.7) 


The finite square step (Figure 7.8) provides a simplified model of the potential 
energy function that confronts an electron as it crosses the interface between two 
homogeneous media. The discontinuous change in the potential energy at x = 0 
is, of course, unrealistic, but this is the feature that makes the finite square step 
simple to treat mathematically. The fact that we are dealing with a square step 
means that we shall only have to consider two regions of the z-axis: Region 1 
where x < 0, and Region 2 where x > 0. 


onergy A 
Eo 
Vf eee ee eee 
Region 2 
a z Figure 7.8 A finite square 


step of height Vo < Ep. 


Classically, when a finite square step of height Vo scatters a rightward moving 
beam in which each particle has energy Ey > Vo, each of the particles will 
continue moving to the right but will be suddenly slowed as it passes the point 
x = 0. The transmitted particles are slowed because, in the region x > 0, each 
particle has an increased potential energy, and hence a reduced kinetic energy. 
The intensity of each beam is the product of the linear number density and the 
speed of the particles in that beam. To avoid any accumulation of particles at the 
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step, the incident and transmitted beams must have equal intensities; the slowing 
of the transmitted beam therefore implies that it has a greater linear number 
density than the incident beam. 


Exercise 7.2 In general terms, how would you expect the outcome of the 
quantum scattering process to differ from the classical outcome? | 


As usual, we start our analysis by writing down the relevant Schrödinger equation: 


OW (x,t h? 0?U(a,t 
ih z ) S= ai ) +V(x)¥ (x,t), (7.8) 
where V(x) is the finite square step potential energy function given in 
Equations 7.6 and 7.7. We seek stationary-state solutions of the form 
P(x, t) = y(xje~iEot/", where Ep is the fixed energy of each beam particle. The 
task of solving Equation 7.8 then reduces to that of solving the time-independent 
Schrédinger equations 


h? dw 
=n Eoy (x) for x < 0, (7.9) 
R? dy 
- 5 g7 t Yle) = Eop(z) forg >0. (7.10) 
A simple rearrangement gives 
dY 2mE 
c 5 U(x) =0 for x < 0, 
x 
dy 2m(Ep — Vo 
— + A = 0) w(x) =0 for x > 0, 
and it is easy to see that these equations have the general solutions 
a(x) = Ae? 4+ Behe for x < 0 (Region 1), (7.11) 
(a) = Ce? + De? for x > 0 (Region 2), (7.12) 


where A, B,C and D are arbitrary complex constants, and the wave numbers in 
Region 1 and Region 2 are respectively 


V2mE \/2m( Eo — Ve 

pa ad hie (1.13) 
h h 

You may wonder why we have expressed these solutions in terms of complex 

exponentials rather than sines and cosines. The reason is that the individual terms 

in Equations 7.11 and 7.12 have simple interpretations in terms of the incident, 


reflected and transmitted beams. To see how this works, it is helpful to note that 


pe = “ind ete = +hketike 
It therefore follows that terms proportional to e*” are associated with particles 
moving rightward at speed ik/m, while terms proportional to e~'** are 
associated with particles moving leftward at speed hk /m. 


These directions of motion can be confirmed by writing down the corresponding 
stationary-state solutions, which take the form 


P(x, t) = Aeilkis—ot) 4 Bei(kittwt) forg < 0, (7.14) 
W(x, t) = Celkr=ot) 4 DeWeartut) forg > 0, (7.15) 
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where w = Ey /h. We can then identify terms of the form ei(ke—wt) as plane waves 


travelling in the positive x-direction, while terms of the form eilkatwt) are 
plane waves travelling in the negative x-direction. None of these waves can be 
normalized, so they cannot describe individual particles, but you will see that they 
can describe steady beams of particles. 


In most applications of wave mechanics, the wave function U(x, t) describes the 
state of a single particle, and |W (x, t)|? represents the probability density for that 
particle. In the steady-state approach to scattering, however, it is assumed that the 
wave function (x,t) describes steady beams of particles, with |Y (x, t)|? 
interpreted as the number of particles per unit length — that is, the linear number 
density of particles. We know that the wave function is not normalizable, and this 
corresponds to the fact that the steady beams extend indefinitely to the left and 
right of the step and therefore contain an infinite number of particles. This will 
not concern us, however, because we only need to know the linear number density 
of particles, and this is given by the square of the modulus of the wave function. 


Looking at Equation 7.14, and recalling that the first term Agl(kiz—wt) represents a 
wave travelling in the positive x-direction for x < 0, we identify this term as 
representing the incident wave in Region 1 (x < 0). We can say that each particle 
in the beam travels to the right with speed vine = ik: /m, and that the linear 

You will find further justification number density of particles in the beam is 

of this interpretation in the next 


subsection. = jacit = ja e en = |A]? 


Nine 
Similarly, the second term on the right of Equation 7.14 can be interpreted as 
representing the reflected beam in Region 1 (x < 0). This beam travels to the left 
with speed vref = Akı /m and has linear number density ner = |B|. 


The first term on the right of Equation 7.15 represents the transmitted beam in 
Region 2 (x > 0). This beam travels to the right with speed Ujrans = Hk2/m 
and has linear number density nrans = |C 7, The second term on the right of 
Equation 7.15 would represent a leftward moving beam in the region x > 0. On 
physical grounds, we do not expect there to be any such beam, so we ensure its 
absence by setting D = 0 in our equations. 


Using these interpretations, we see that the beam intensities are: 


hky hky hk 

jne 55A P, rer = |B|? and furans = —=|C|?. 016 
m m m 

Expressions for the reflection and transmission coefficients then follow from 


Equation 7.5: 


Jref |B|? B - 

R im (A Al? G17 
Juans  Ke|C\? ka |C}? 

T = = = 7.18 
ie Ae Fal A os) 


It is worth noting that the expression for the transmission coefficient includes the 
wave numbers kı and k2, which are proportional to the speeds of the beams in 
Regions 1 and 2. The wave numbers cancel in the expression for the reflection 
coefficient because the incident and reflected beams both travel in the same region. 


186 


7.2 Scattering: a stationary-state approach 


To calculate R and T, we need to find the ratios B/A and C/A. To achieve 
this, we must eliminate unwanted arbitrary constants from our solutions to the 
time-independent Schrödinger equation. This can be done by requiring that the 
solutions satisfy the continuity boundary conditions introduced in Chapter 3: 

e w(x) is continuous everywhere. 

e dy(x)/dz is continuous where the potential energy function is finite. 


The first of these conditions tells us that our two expressions for (x) must match 
at their common boundary x = 0. From Equations 7.11 and 7.12, we therefore 


obtain 
A+B=C. (7.19) 
Taking the derivatives of Equations 7.11 and 7.12, 
d , , 
T ik; Ae!” — ik; Be” forg <0 
dx 
ay = ikpCei*22 for x > 0, 
dz 
so requiring the continuity of dy/dx at x = 0 implies that 
ik1 A —ikyB = ik2C. (7.20) 
After some manipulation, Equations 7.19 and 7.20 allow us to express B and C in 
terms of A 
ky —k 2 
= 24 and = kı f 
ky + ke ky + k2 
Combining these expressions with Equations 7.17 and 7.18, we finally obtain 
(kı — k2)? 
ie WA 
(ky + k2)? Ce 
Akiko 
T = ——.. 2? 
(ki + k2)? ee 


Since ky = /2mEo/h and kg = \/2m(Eo — Vo)/h, where Eo is the incident 
particle energy and Vo is the height of the step, we have now managed to express 
Rand T entirely in terms of given quantities. The transmission coefficient, T, is 
plotted against Fy /Vo in Figure 7.9. 


Figure 7.9 A graph of the 
transmission coefficient T 
against Ep /Vo for a finite 
square step of height Vo. 
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The above results have been derived by considering a rightward moving beam 
incident on an upward step of the kind shown in Figure 7.8. However, almost 
identical calculations can be carried out for leftward moving beams or downward 
steps. Equations 7.21 and 7.22 continue to apply in all these cases, provided we 
take kı to be the wave number of the incident beam and kə to be the wave number 
of the transmitted beam. 


The formulae for R and T are symmetrical with respect to an interchange of kı 
and k2, so a beam of given kinetic energy, incident on a step of given magnitude, is 
reflected to the same extent no matter whether the step is upwards or downwards. 
This may seem strange, but you should note that the reflection is a purely quantum 
effect, and has nothing to do with any classical forces provided by the step. 


Another surprising feature of Equation 7.21 is that R is independent of m and so 
does not vanish as the particle mass m becomes very large. However, we know 
from experience that macroscopic objects are not reflected by small changes in 
their potential energy function — you can climb upstairs without serious risk of 
being reflected! How can such everyday experiences be reconciled with wave 
mechanics? 


This puzzle can be resolved by noting that our calculation assumes an abrupt step. 
Detailed quantum-mechanical calculations show that Equation 7.21 provides a 
good approximation to reflections from a diffuse step provided that the 
wavelength of the incident particles is much longer than the distance over which 
the potential energy function varies. For example, Equation 7.21 accurately 
describes the reflection of an electron with a wavelength of 1 nm from a finite step 
that varies over a distance of order 0.1 nm. However, macroscopic particles have 
wavelengths that are much shorter than the width of any realistic step, so the above 
calculation does not apply to them. Instead, these particles obey the classical limit 
(as outlined in Chapter 6) so they are not reflected to any noticeable extent. 


Although we have been discussing the behaviour of beams of particles in this 
section, it is important to realize that these beams are really no more than a 
convenient fiction. The beams were simply used to provide a physical 
interpretation of de Broglie waves that could not be normalized. The crucial point 
is that we have arrived at explicit expressions for R and T, and we have done so 
using relatively simple stationary state-methods based on the time-independent 
Schrödinger equation rather than computationally complicated wave packets. 
Moreover, as you will see, the method we have used in this section can be 
generalized to other one-dimensional scattering problems. 


Exercise 7.3 (a) Use Equations 7.21 and 7.22 to show that R+T = 1. 
(b) Evaluate R and T in the case that Eg = 2Vo, and confirm that their sum is 
equal to 1 in this case. 


Exercise 7.4 Consider the case where kz = ky, /2. 
(a) Express B and C in terms of A. 


(b) Show that in the region x > 0, we have ||? = 16|A|?/9 = constant, while in 
the region x < 0, we have |W|? = |A|?(10/9 + (2/3) cos(2k,2)). 


(c) What general physical phenomenon is responsible for the spatial variation of 
|W |? to the left of the step? 
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(d) If the linear number density in the incident beam is 1.00 x 1074 m~!, what are 
the linear number densities in the reflected and transmitted beams? 


Exercise 7.5 Based on the solution to Exercise 7.4, sketch a graph of ||? that 
indicates its behaviour both to the left and to the right of a finite square step. E 


7.2.3 Probability currents 


The expressions we have derived for reflection and transmission coefficients were 
based on the assumption that the intensity of a beam is the product of the speed of 
its particles and their linear number density. This assumption seems very natural 
from the viewpoint of classical physics, but we should always be wary about 
carrying over classical ideas into quantum physics. In this section we shall 
establish a general quantum-mechanical formula for the beam intensity. The 
formula will be consistent with the assumptions made so far, but is also more 
general, applying in regions where a classical beam would not exist and for 
localized wave packets as well as steady beams. 


At the heart of our analysis lies the idea that matter is conserved. Neglecting 
relativistic processes in which particles can be created or destroyed, the total 
number of particles remains fixed. This is built deep into the formalism of 
quantum mechanics: if the wave function describing a particle is normalized now, 
it will remain normalized forever because particles do not simply disappear. The 
conservation of particles applies locally as well as globally, so if the number of 
particles in a small region changes, this must be due to particles entering or 
leaving the region by crossing its boundaries. We shall now express this idea in 
mathematical terms. 


Let us first consider the one-dimensional flow of a fluid along the z-axis. At each 
point, we define a fluid current j,,(a,t) that represents the rate of flow of fluid 
particles along the x-axis. If the fluid is compressible, like air, this fluid current 
may vary in space and time. 


Figure 7.10 shows a small one-dimensional region between x and x + dx. The 
number of particles in this region can be written as n(x, t) 6x, where n(x, t) is the 
linear number density of particles. The change in the number of particles in the 
region during a small time interval ôt is then 

n(x, t) dx = jx(a, t) Ot — jlx + 6x) dt, (7.23) Ale ewe 
where, for flow in the positive x-direction, the first term on the right-hand side i 


> 


represents the number of particles entering the region from the left and the z x + 6x 
second term represents the number of particles leaving the region to the right. 
Rearranging Equation 7.23 gives Figure 7.10 Any change in 
. . the number of particles in a 
t E t 
a ) = Jele + e jalz,t) : small region is due to fluid 
. currents that carry particles into 
and, on taking the limit as ôx and ôt tend to zero we see that or out of the region. 
On(a, t) OF, (a5 t) 
= 7.24 
ot Ox ae 


This result is called the equation of continuity in one dimension. With a little 
care, it can be extended to quantum mechanics. 
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The potential energy function 
V(x) is real and cancels out. 
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For a single-particle wave packet in quantum mechanics, the flowing quantity is 
probability density. This is evident from images of wave packets as ‘blobs’ of 
moving probability density (e.g. Figure 7.6). Now we know that probability 
density is represented in quantum mechanics by W*W, so we should be able to 
construct the appropriate equation of continuity by examining the time derivative 
of this quantity. 


Obviously, we have 


O(U* wv) Ov OW 
=y y 7.25 
ôt a OE oo 
where Schrédinger’s equation dictates that 
y nh? oY 
hê = 9 +V(x)WU(z,t). 


ot 2m Ox? 
Dividing through by if, the rate of change of the wave function is 
Ow ih Pw i 
Ot 2môðr? h 
Substituting this equation, and its complex conjugate, into Equation 7.25, we then 
obtain 


aVV) ih (ež aa 


V(x)Y (x,t). (1.26) 


ðt 2m Ox? Ox? 
and a further manipulation gives 


aww) að E ( „ðv vw). 


at ax |m Ox Ox 


This equation can be written in the form of an equation of continuity: 


On(x,t) — Əjz(x,t) 
S s AEE, (7.27) 


provided that we interpret n(x, t) as the probability density U* WV, with a 
corresponding current 


er eee Ge vee | (7.28) 


Ox Ox 


In the one-dimensional situations we are considering, j,(2,t) is called the 
probability current. In one dimension, the probability density n = W*W is a 
probability per unit length, and therefore has the dimensions of [L]~1. It follows 
from Equation 7.27 that the probability current has dimensions of 


[ie] = [L] x [T]~* x [L] = [T]*, 
and therefore has the SI unit ‘per second’, as expected for a current of particles. 


These ideas can be readily extended to steady beams of particles. Snapshots of a 
steady beam would not reveal any changes from one moment to the next but the 
beam nevertheless carries a steady flow of particles, just as a steadily flowing river 
carries a current of water. For a particle beam, Y*W represents the linear number 
density of particles and the probability current is the rate of flow of particles in the 
positive x-direction. For a steady beam, described by a stationary-state wave 
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function, U(a,t) = o)(x)e~*”*/", the time-dependent phase factors cancel out in 
Equation 7.28, and the probability current can be written more simply as 


_ ih (dy dy* 
i) = e (v T w £), (7.29) 


which is independent of time. It is important to realize that jy is a signed quantity; 
it is positive for a beam travelling in the positive x-direction, and negative for a 
beam travelling in the negative x-direction. This is unlike the beam intensity 7 
introduced earlier, which is always positive. It is natural to define the beam 
intensity of a steady beam to be the magnitude of the probability current: j = |j,|. 
It is this definition that gives us a way of calculating beam intensities without 
making unwarranted classical assumptions. 


In fact, each beam intensity calculated using Equation 7.29 turns out to be 
precisely what we have always assumed — the product of a particle speed and a 
linear number density — as you can check by tackling Exercise 7.7 below. So our 
analysis adds rigour, but contains no surprises. However, the really significant 
feature of Equation 7.28 is its generality; it applies to single-particle wave packets 
as well as to steady beams, and (as you will see later) it will also apply in cases of 
tunnelling, where a classical beam does not exist. 


Exercise 7.6 Show that j,(a,t) as defined in Equation 7.28 is a real quantity. 


Exercise 7.7 Using the solutions to the Schrödinger equation that were 
obtained in the stationary-state approach to scattering from a finite square step, 
evaluate the probability current in the regions x > 0 and x < 0. Interpret your 
results in terms of the beam intensities in these two regions. | 


7.2.4 Scattering from finite square wells and barriers 


The procedure used to analyze scattering from a finite square step can also 

be applied to scattering from finite square wells or barriers, or indeed to any 
combination of finite square steps, wells and barriers. The general procedure is as 
follows: 


e Divide the x-axis into the minimum possible number of regions of constant 
potential energy. 


e Write down the general solution of the relevant time-independent Schrödinger 
equation in each of these regions, remembering to use the appropriate value of 
the wave number k in each region and introducing arbitrary constants as 
necessary. 


e Use continuity boundary conditions to determine all but one of the arbitrary 
constants. The one remaining constant is associated with the incident beam, 
which may enter from the left or the right. 


e Obtain expressions for all the beam intensities relative to the intensity of the 
incident beam. 


e Determine the reflection and transmission coefficients from ratios of beam 
intensities. 
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The best way to become familiar with this procedure is by means of examples and 
exercises. Here is a worked example involving a finite square well of the kind 
shown in Figure 7.11. 


Essential skill Worked Example 7.1 
Solving scattering problems in A particle of mass m with positive energy Epo is scattered by a 
one dimension one-dimensional finite square well of depth Vo and width L (shown in 


Figure 7.11). Derive an expression for the probability that the particle will 
be transmitted across the well. 


Figure 7.11 A finite square well of depth Vo and width L. A particle of 
energy Ep is scattered by the well. 


Solution 

Suppose the particle concerned to be part of an intense steady beam of 
identical particles, each having energy Eo and each incident from the left on 
a well located between x = 0 and x = L. 

Divide the x-axis into three regions: Region 1, (x < 0), where V(x) = 0; 
Region 2, (0 < x < L), where V(x) = — Vo; and Region 3, (x > L), where 
V(x) = 0. In each region the time-independent Schrödinger equation takes 
the general form 


m ae 
-P SY + V(a)y(2) = Eola), (7.30) 
2m dx 
so the solution in each region is: 
(x) = Ae? + Be" forg <0, (7.31) 
(a) = Ce + De? ford <a < L, (7.32) 
W(x) = Fe? + Ge" for x >L, (7.33) 


To avoid confusion with energy, where A, BACHD F and G are arbitrary constants, and the wave numbers 
we don’t use the symbol E in Region 1 and Region 2 are 
an arbitrary constant in ae V2mEo ee \/2m(Eo + Vo) . (7.34) 
quation 7.33. h h 
Note that the wave number in Region 3 is the same as that in Region 1. This 
is because the potential energy function is the same in these two regions. 


There is no leftward moving beam in Region 3, so we set G = 0. Since the 
potential energy function is finite everywhere, w(x) and dy/dx must be 
continuous everywhere. Continuity of y(x) and dy/dx at x = 0 implies that 


A+B=C+D (7.35) 
iky A — ik, B = ikeC — ikeD, (7.36) 
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while continuity of y(x) and dy/dz at x = L gives 
Celtel 4 pe-ikzL — pekiL (7.37) 


ikoCel*2” — ikyo De TAE = ik, Fel™”. (7.38) 


Since the wave numbers in Regions 1 and 3 are both equal to ky, the 
intensities of the incident and transmitted beams are 


hk 2 ħkı 


and Jtrans = He (7.39) 


2m 
and the required transmission coefficient is given by T = |F'|?/|.A|?. 

The mathematical task is now to eliminate B, C and D from Equations 7.35 
to 7.38 in order to find the ratio F'/A. To achieve this, we note that the 
constant B only appears in the first two equations, so we take the 
opportunity of eliminating it immediately. Multiplying Equation 7.35 by ik; 
and adding the result to Equation 7.36 we obtain 


Jik; A = i(k =F mE = i(k2 = kı)D. (7.40) 


Now we must eliminate C and D from the remaining equations. To 
eliminate D, we multiply Equation 7.37 by ik and add the result to 
Equation 7.38. This gives 


2ikze? CO = i(k + ki) Fel, 
kə + k a 
so C= ee (7.41) 


Similarly, multiplying Equation 7.37 by ik and subtracting the result from 
Equation 7.38 we see that 
2ikze 24 D = i(k2 = kı) Fet 
w D= k2 — ki peikiLgikoL : (7.42) 
2k 


Finally, substituting Equations 7.41 and 7.42 into Equation 7.40 and 
rearranging slightly we obtain 
F _ Aky kage 4 


A (ko + ki) e “25 — (kg — ki) zett 


so the transmission coefficient is given by 


2 
z 16k2k2 (1.43) 
|(k2 + k1)2e7ik2L — (ko — hry Pele |? ` 


F 
A 


This is the expression we have been seeking; it only involves kı, k2 and L, 
and both kı and k2 can be written in terms of m, Eo and Vo. 
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Although Equation 7.43 provides a complete answer to the worked example, it is 
expressed in a rather opaque form. After several pages of substitutions and 
manipulations (which are not a good investment of your time) it is possible to 
recast and simplify this formula. We just quote the final result: 


4Eo( Eo + Vo) 


T= ; 7.44 
4Eo(Eo + Vo) + V2 sin?(k2L) G 


Treating Eo as an independent variable and Vo as a given constant, this function 
can be displayed as a graph of T against Eo /Vo, as shown in Figure 7.12. 


TA 
1.0 Figure 7.12 The 
transmission coefficient T for 
E a particle of energy Eo 
0.6 scattering from a finite square 
well of depth Vo, plotted 
0.4 against Ey/Vo. To plot this 
u graph, we have taken the 
Í particle to have the mass of an 
0 iy electron and taken the well to 
l z j a /Vo have a depth of 8.6 eV and a 
width of 1.0 nm. 


As the incident energy is increased significantly above 0 (the top of the well), 
transmission becomes relatively more likely, but there is generally some chance of 
reflection (R = 1 — T # 0), and for some energies the reflection probability may 
be quite high. However, Figure 7.12 also shows that there are some special 
incident particle energies at which T' = 1, so that transmission is certain to 

occur. Although it is a rather poor use of terminology, these peaks of high 
transmission are usually called transmission resonances. Equation 7.44 shows 
that transmission resonances occur when sin(k2L) = 0, that is when 


koL=Nr for N=1,2,3,.... (7.45) 


Recalling the relationship k = 27 /A between wave number and wavelength, we 
can also express this condition as NA2 = 2L; in other words: 


A transmission resonance occurs when a whole number of wavelengths 
occupies the full path length 2L of a wave that crosses the width of the well 
and is reflected back again. 


This condition can be interpreted in terms of interference between waves reflected 
at x = 0 and x = L. The interference turns out to be destructive because 
reflection at the x = 0 interface is accompanied by a phase change of 7. Of 
course, suppression of the reflected beam is accompanied by an enhancement 

of the transmitted beam. The effect is similar to one found in optics, where 
destructive interference between waves reflected from the front and back surfaces 
of a thin transparent film accounts for the success of the anti-reflective coatings on 
lenses and mirrors. 


7.2 Scattering: a stationary-state approach 


Before leaving the subject of scattering from a finite well ARe(2) 
there is one other point that deserves attention. This concerns 
the precise form of the functions 7)(z) in the three regions 
we identified earlier. The value of |A|? is equal to the linear 
number density of particles in the incident beam. If we 


regard A as being known, the values of the constants B, Eo 

C, D and F can be evaluated using the continuity boundary 0 

conditions and graphs of y(x) can be drawn. 7 

Since (x) is generally complex there are several possible i 

graphs that might be of interest, including the real part of 

w, the imaginary part of y, and |7)|?. Figure 7.13 shows Figure 7.13 A typical example of the real part 
a typical plot of the real part of for chosen values of m, Re(4) of the function y(x) for the kind of 

Eo, Vo and L; the following points should be noted: square well shown in Figure 7.11. 


1. In each region, y(x) is a periodic function of x. 


2. The wavelength is smaller inside the well than outside. The shorter 
wavelength corresponds to a higher wave number, and higher momentum, 
and echoes the classical increase in speed that a particle experiences as it 
enters the well. 


3. The amplitude of the wave is smaller inside the well than outside. This is 
because the beam moves more rapidly inside the well, and has a lower linear 
number density there. 


Exercise 7.8 With one modification, the stationary-state method that was 
applied to scattering from a finite square well can also be applied to scattering 
from a finite square barrier of height Vo and width L, when Eo > Vo. Describe the 
modification required, draw a figure analogous to Figure 7.13, but for the case of a 
square barrier, and comment on any differences between the two graphs. 


Exercise 7.9 Draw two sketches to illustrate the form of the function 

|W (a, t)|? in the case of the stationary-state approach to scattering from (a) a finite 
square well and (b) a finite square barrier. Comment on the time-dependence of 
|W (a, t)|? in each case. a 


7.2.5 Scattering in three dimensions 


Sophisticated methods have been developed to analyze scattering in 
three-dimensions. The complexity of these methods makes them unsuitable for 
inclusion in this book but it is appropriate to say something about the basic 
concepts involved. 


In three dimensions, we are obliged to think in terms of scattering at a given angle, 
rather than in terms of one-dimensional reflection or transmission. We distinguish 
between the incident particles (some of which may be unaffected by the target) 
and the scattered particles which are affected by the target in some way (changing 
their direction of motion, energy or state of internal excitation). The detectors for 
the scattered particles are placed far from the target, well outside the range of 
interaction of the incident beam and the target, so the scattering process is 
complete by the time the particles are detected. The incident beam is assumed to 
be uniform and broad enough to cover all the regions in which the beam particles 
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interact with the target. The incident beam is characterized by its flux; this is the 
rate of flow of particles per unit time per unit area perpendicular to the beam. 


If we consider a particular scattering experiment (electron-proton scattering, for 
example), one of the main quantities of interest is the total cross-section, o. This 
is the total rate at which scattered particles emerge from the target, per unit time 
per unit incident flux. The total cross-section has the dimensions of area. Very 
loosely, you can think of it as representing the ‘effective’ area that the target 
presents to an incident projectile, but you should not give too much weight to this 
classical interpretation, as most total cross-sections vary markedly with the energy 
of the incident particles. An acceptable SI unit for the measurement of total 
cross-sections would be m?, but measured cross-sections are generally so small 
that physicists prefer to use a much smaller unit called the barn, defined by the 
relation 1 barn = 1 x 107?8 m?. The name is intended as a joke, 1 barn being 
such a large cross-section in particle and nuclear physics that it can be considered 
to be ‘as big as a barn door’. Many cross-sections are measured in millibarn (mb), 
microbarn (ub) or even nanobarn (nb). 


Scattering processes that conserve the total kinetic energy of the colliding 
particles are said to be examples of elastic scattering. They may be contrasted 
with cases of inelastic scattering where the particles may change their internal 
state of excitation or be absorbed; particles may even be created or destroyed, 
especially at very high energies. In reality, total cross-sections often contain both 
elastic and inelastic contributions. 


Scattering experiments are often analyzed in great detail. The total cross-section 
arises as a sum of contributions from particles scattered in different directions. 
For each direction, we can define a quantity called the differential cross-section, 
which tells us the rate of scattering in a small cone of angles around the given 
direction. The integral of the differential cross-section, taken over all directions, is 
equal to the total cross-section. We can also vary the energy of the incident beam. 
Both the total cross-section and the differential cross-section depend on the 
energies of the incident particles. There is therefore a wealth of experimental 
information to collect, interpret and explain. 


In exploring the microscopic world of atoms, nuclei and elementary particles, 
physicists have few options, other than to carry out a scattering experiment. This 
process has been compared with that of trying to find out how a finely-crafted 
watch works by the expedient of hurling stones at it and seeing what bits come 
flying out. It is not a delicate business, but by collecting all the data that a 
scattering experiment provides, and by comparing these data with the predictions 
of quantum physics, physicists have learnt an amazing amount about matter on the 
scale of atoms and below. One early discovery in the scattering of electrons 
from noble gas atoms (such as xenon) was a sharp dip in the measured 
cross-section at an energy of about 1 eV. The experimental discovery of this 
Ramsauer-Townsend effect in the early 1920s was an early indication from 
elastic scattering that some new theoretical insight was needed that would 

take physics beyond the classical domain. The effect is now recognized as a 
three-dimensional analogue of the transmission resonance we met earlier. 


At the much higher collision energies made available by modern particle 
accelerators, such as those at the CERN laboratory in Geneva, total cross-sections 
become dominated by inelastic effects, as new particles are produced. As an 
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example, Figure 7.14 shows some data concerning the scattering of K~ mesons 
by protons. The upper curve shows the variation of the total cross-section over a 
very wide range of energies, up to several GeV (1 GeV = 10° eV). The lower 
curve shows the contribution from elastic scattering alone. As the collision energy 
increases, the contribution from elastic scattering becomes less and less important 
as inelastic processes become more common. 
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Figure 7.14 The total cross-section (upper curve) and the elastic contribution 
alone (lower curve), plotted against collision energy, for the scattering of K- 
mesons by protons. 


7.3 Tunnelling: wave packets and 
stationary states 


One of the most surprising aspects of quantum physics is the ability of particles to 1E]? 
pass through regions that they are classically forbidden from entering. This is the 
phenomenon of quantum-mechanical tunnelling that was mentioned in the 


Introduction. a 


In this section we first demonstrate the phenomenon of tunnelling with the aid A? 
of wave packets. We then go on to examine some of its quantitative features 
using stationary-state methods, similar to those used in our earlier discussion of 


scattering from wells and barriers. ae eas ees 
= Be T 


r=0 r= 

7.3.1 Wave packets and tunnelling in one dimension Figure 7.15 The passage of a 

wave packet with (E&E) = Eo 
Figure 7.15 shows a sequence of images captured from the wave packet through a finite square barrier of 
simulation program you used earlier. The sequence involves a Gaussian wave height Vo when Eo < Vo. 
packet, with energy expectation value (E) = Fp, incident from the left on a finite The probability density |Y |? 
square barrier of height Vo. The sequence is broadly similar to that shown in is shown in a sequence of 
Figure 7.6, which involved a similar wave packet and a similar barrier, but with snapshots with time increasing 
one important difference; in the earlier process Ey was greater than Vo, so from top to bottom. 
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transmission was classically allowed, but in the case of Figure 7.15 Eo is less than 
Vo and transmission is classically forbidden. The bottom image shows that 
transmission can occur in quantum mechanics. 


In the case shown in Figure 7.15, part of the reason for transmission is that the 
wave packet has a spread of energies, some of which lie above the top of the 
barrier. However, there is a second reason, which applies even for wave packets 
with energies wholly below the top of the barrier; there is the possibility that a 
particle, with insufficient energy to surmount the barrier, may nevertheless tunnel 
through it. For a given wave packet, the probability of tunnelling decreases with 
the height of the barrier and it also decreases very markedly with its thickness. We 
shall now use stationary-state methods to investigate this phenomenon. 


7.3.2 Stationary states and barrier penetration 


The example of tunnelling we have just been examining can be regarded as a 
special case of scattering; it just happens to have Ey < Vo. As long as we keep 
this energy range in mind, we can apply the same stationary-state methods to the 
study of tunnelling that we used earlier when studying scattering. 


As before, we shall start by considering the finite square step, whose potential 
energy function was defined in Equations 7.6 and 7.7. This is shown for the case 
Eo < Vo in Figure 7.16. The potential energy function divides the x-axis into two 
regions: Region 1 (x < 0) which contains the incident and reflected beams, and 
Region 2 (x > 0) which contains what is effectively an infinitely wide barrier. 
There is no possibility of tunnelling through the barrier in this case since there is 
no classically allowed region on the far side, but the finite square step nonetheless 
constitutes a valuable limiting case, as you will see. 


Region 2 


Figure 7.16 A finite square step potential energy function. Also shown is the 
energy Eg of an incident particle for the case Ey < Vo, where Vo is the height of 
the step. 


Proceeding as before, we seek stationary-state solutions of the Schrodinger 
equation of the form W(x, t) = a)(x)e~*"0'/", where y(x) is a solution of the 
corresponding time-independent Schrédinger equation. In this case we might try 
to use exactly the same solution as in Section 7.2.2 but doing so would complicate 
the analysis since we would find that in the region x > 0 the wave number 

k2 = \/2m(Eo — Vo) /h would be imaginary. In view of this, it is better to use the 
approach taken in a similar situation in Chapter 3; recognize that Ey < Vo 
generally implies a combination of exponentially growing and exponentially 
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decaying terms in the region x > 0, and write the solutions as 
(a) = Ae? + Be forg < 0, (7.46) 
v(x) = Ce ™ + De™ for x > 0, (7.47) 


where, A, B, C and D are arbitrary complex constants, while kı and a are real 
quantities given by 


vV 2mEo 
i and a= i 


sana) (7.48) 


ky = 


We require D to be zero on physical grounds, to avoid having any part of the 
solution that grows exponentially as x approaches infinity. To determine the 
values of B and C relative to that of A we impose the usual requirement (for a 
finite potential energy function) that both w(x), and its derivative dy/dx, must be 
continuous everywhere. Applying these conditions at x = 0 we find: 


A+B=C 
ik A —ik,B = —aC, 


from which it follows that 


C= avi A and B= at aay 
iky — ik} —a 
The reflection coefficient is given by For any complex number, z, 
R B|? lik tal? 1? a +ik |’ 1 zj Bae 
-l2 -Jik -a =A a—ikjy| ` ae 2*z 
So, if particles of energy Eo < Vo encounter a finite square 
step of height Vo, reflection is certain. There is no transmission E 
and no possibility of particles becoming lodged inside the 
step; everything must eventually be reflected. Note however 
that (a) is not zero inside the step (see Figure 7.17). Rather, 
it decreases exponentially over a length scale determined z 
by the quantity a = \/2m(Vo — Eo)/h, which is usually 0 c= 0) z 
called the attenuation coefficient. This is another example SDDS 
of the phenomenon of barrier penetration that you met (7s |e eee E A 
in Chapter 3. It is not the same as tunnelling since there is E 
no transmitted beam, but it is what makes tunnelling possible, 
and the occurrence of exponentially decaying solutions Region 1 Region 2 
in a classically forbidden region suggests why tunnelling E=0 r=0 a% 
probabilities decline rapidly as barrier width increases. 
Exercise 7.10 Show that the stationary-state probability Figure 7.17 The quantity |Y (x)|? plotted 
density |y(a)|? in Region 1 of Figure 7.17 is a periodic against x for a finite square step in the case 
function of x with minima separated by 7/ky. Eo < Vo. There is a finite probability that the 
Exercise 7.1 | Show that the probability current in Region 2 p article will p enetrate the ep: a though there 
of Figure 7.17 is zero. m is no possibility of tunnelling through it. 
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7.3.3 Stationary states and tunnelling in one dimension 


We will now use the stationary-state approach to analyze 

the tunnelling of particles of energy Eo through a finite 

square barrier of width L and height Vo when Eo < Vo 
Sin (see Figure 7.18). 


a 
Region 3 Our main aim will be to find an expression for the transmission 


E=0 me ea T coefficient. By now, you should be familiar with the general 
technique for dealing with problems of this kind, including 
the existence of exponentially growing and decaying solutions 


Figure 7.18 A finite square barrier of width L in the classically forbidden region, so we shall immediately 
and height Vo, together with the energy Eo of write down the solution of the relevant time-independent 
each tunnelling particle. Schrödinger equation: 
y(x) = Ae? + Be™*” forg <0 (Region 1), (7.49) 
y(x) = Ce“ + De™ for0 <a < L (Region 2), (7.50) 
w(x) = Fe? forg > L (Region 3), (7.51) 


where A, B, C, D and F are arbitrary constants and 


/imED 2 ZA 
k= — while TELA ~ Fo) 


Note that the term that might describe a leftward moving beam in Region 3 has 
already been omitted, and the wave number in Region 3 has been set equal to that 
in Region 1. 


Requiring the continuity of y(x) and dy/dz at x = 0 and at x = L leads to the 
following four relations: 


A+B=C4D, 
ik, A — ikı B = —aC + aD, 
Ce tt + Det! — Fett, 


— Cae" + Dae?! = ik, Fe}, 


After some lengthy algebra, similar to that in Worked Example 7.1, these four 
equations can be reduced to a relationship between F and A, from which it is 
possible to obtain the following expression for the transmission coefficient 

T = |F/A}?. 


= 4Eq(Vo — Eo) 
4Eo(Vo — Eo) + Vê sinh? (aL) 


(7.52) 


When aL > 1, the denominator of this expression can be approximated by 
IVg erah , and the transmission coefficient through the barrier is well-described by 
the useful relationship 


Remember: Eo Py -o 
sinh z = (e? — e7?) /2. T16 (2) (1 = z) e (7.53) 


This shows the exponential behaviour that might have been expected on the basis 
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of our earlier results for barrier penetration into a finite square 


not described by a simple exponential function; there are 
both exponentially decreasing and exponentially increasing 
contributions to 7(z) in that region. 
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step, but in this case it is a true tunnelling result. It tells us wh? 
that tunnelling will occur, but indicates that the tunnelling 

probability will generally be rather small when aL > 1, and 

will decrease rapidly as the barrier width L increases. 

A graph of |w?| plotted against x for a finite square barrier y 

in the case Eo < Vo will look something like Figure 7.19. RSV 
Note that because the incident and reflected beams have Vo 

different intensities, the minimum value of |7)| in Region 1 Eo 

is always greater than zero. Also note that for a square 

barrier of finite width the declining curve in Region 2 is pat 


Figure 7.19 The quantity || plotted against 


x for a finite square barrier in the case Eo < Vo. 


Worked Example 7.2 

Electrons with a kinetic energy of 5eV are incident upon a finite square 
barrier with a height of 10 eV and a width of 0.5nm. Estimate the value of 
T and hence the probability that any particular electron will tunnel through 
the barrier. 

Solution 

In this case, Vg — Eo = (10 —5)eV = 5 x 1.6 x 107199. It follows that 


a = y 2m(Vo — Eo)/h 


(2 x 9.11 108 ke <5 x 160) x 10—1 2 i/2 
1.06 x 10-84Js 


= 1.14 x 10$? m+. 


With L = 5 x 1071? m, it follows that 
aL = 1.14 x 10!m-! x 5 x 1071’ m = 5.7. 


Since this is much larger than 1, we can use Equation 7.53 to estimate 


1 1 
GS AGN = Wi) bew- a aa 0N 
2 2 


This is the probability that any particular electron will tunnel through the 
barrier. (About one electron in 25 000 will pass through.) 


Exercise 7.12 Find expressions for the probability current in each of the three 
regions in the case of tunnelling through a finite square barrier. Comment on the 
significance of your result for Region 2. 


Essential skill 


Evaluating transmission 
coefficients for wide barriers 
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7.3.4 Simulations using stationary states 


Now would be a good time to study the computer package Stationary states 

for tunnelling and scattering in one dimension. The package solves the 
time-independent Schrödinger equation for a variety of one-dimensional potential 
energy functions and for a range of energy eigenvalues Eg. It displays the 
results graphically by plotting the real and imaginary parts of w(x) together with 
|y(x)?|. For a given potential energy function, the program will also plot the 
corresponding transmission coefficient. 


For finite square barriers of specified width L and height Vo, you can choose 
values of Eo that are less than Vo as well as values that are greater than Vo, so you 
can study the transition from scattering to tunnelling. The package also allows 
you to study scattering from steps and wells. 


This simple but powerful program can provide significant insight into the 
stationary-state approach to tunnelling and scattering in one dimension. It 
will reward detailed study, but at this stage you should only spend about half 
an hour on it. On-screen instructions will guide you through the activity, but 
you should especially note the following points: 


e For E < Vo, the strong dependence of the transmission coefficient T on 
the height and width of the barrier. 


e For Eo > Vo, the appearance of transmission resonances and their effect 
on |yp(z)]. 

e For a narrow square barrier of fixed height Vo, the effect of gradually 
reducing the incident particle energy Ep from a value far above Vo to 
one that is far below. 

e The differences in the graphs of |y(a)|? (especially in wavelength and 

amplitude) for scattering from wells and barriers. 


7.4 Applications of tunnelling 


The discovery that quantum mechanics permits the tunnelling of particles was 

of great significance. It has deep implications for our understanding of the 
physical world and many practical applications, particularly in electronics and the 
developing field of nanotechnology. This section introduces some of these 
implications and applications. Applications naturally involve the three dimensions 
of the real world, and realistic potential energy functions are never perfectly 
square. Despite these added complexities, the principles developed in the last 
section provide a good basis for the discussion that follows. 


7.4.1 Alpha decay 


Chapter 1 discussed the law of radioactive decay. You saw that, given a sample 
of N(0) similar nuclei at time t = 0, the number remaining at time t is 
N(t) = N(0)e~**, where A, the decay constant for a particular kind of nucleus, 


7.4 Applications of tunnelling 


determines the rate at which the nuclei decay. The half-life is the time needed for 
half of any sufficiently large sample to decay. It is related to the decay constant by 
Ti /2 = (In 2) fr 


We shall now consider an important type of radioactive decay called alpha decay 
in which an atomic nucleus emits an energetic alpha particle. The emitted alpha 
particle consists of two protons and two neutrons and is structurally identical to a 
helium-4 nucleus (He). Alpha decay is the dominant decay mode for a number of 
heavy nuclei (typically those with atomic numbers greater than 84); a specific 
example is the decay of uranium to thorium represented by 


2 234 
BU — ay lh + a, 


where a denotes the alpha particle. Note that the atomic number of the parent 
nucleus decreases by two and its mass number decreases by four. 


Alpha decay was discovered and named by Rutherford in 1898. It was soon 
established that each type of alpha-decaying nucleus emits an alpha particle 
with a characteristic energy, Ea. While these alpha emission energies cover a 
fairly narrow range of values (from about 2 MeV to 8 MeV), the half-lives of 
the corresponding nuclei cover an enormous range (from 107}? s to 10!" s). 
Experiments showed that, within certain families of alpha-emitting nuclei, the 
half-lives and alpha emission energies were related to one another. Written in 
terms of the decay constant, \ = (In 2)/T} j2, this relationship can be expressed in 
the form 
1/2 

A = Ae B/E (7.54) 
where A is a constant that characterizes the particular family of nuclei, and B 
depends on the charge of the individual nucleus. We shall refer to this empirical 
law as the Geiger—Nuttall relation. 


Despite all this information, by the early 1920s alpha decay had become a major 
puzzle to physicists. The cause of the observed Geiger—Nuttall relation was not 
understood. Attempts to explain it on the basis of classical 

physics, with the alpha particle initially confined within the cnergya 
nucleus by an energy barrier that it eventually manages to v 
surmount, did not work. In some cases the observed emission 
energies were too low to be consistent with surmounting the 
energy barrier at all. So, how could the alpha particles escape, 
why did their emission lead to such a staggering range of E=0 
half-lives, and what was the origin of the Geiger—Nuttall 

relation? 


_ Z- Qe? 
F årer 


Answering these questions was one of the early triumphs =W 
of wave mechanics. In 1928 the Russian-American physicist 

George Gamow, and then, independently, Gurney and Condon, 

proposed a successful theory of alpha decay based on quantum Figure 7.20 The potential energy function 
tunnelling. In a simplified version of their approach, the V(r) for an alpha particle in the vicinity of an 
potential energy function responsible for alpha decay has atomic nucleus of atomic number Z. 

the form shown in Figure 7.20. 


Note that V(r) is a function of a radial coordinate r; this is because we are 
dealing with a three-dimensional problem in which the potential energy function 
is spherically symmetric, and r represents the distance from an origin at the centre 
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of the nucleus. Initially, an alpha particle of energy Ea is confined within a 
distance r = ro of the origin by the well-like part of the potential energy function. 
This well is due to the powerful but short-range interaction known as the strong 
nuclear force. In addition, a long-range electrostatic force acts between the 
positively-charged alpha particle and the remainder of the positively-charged 
nucleus, and has the effect of repelling the alpha particle from the nucleus. The 
electrostatic force corresponds to the potential energy function 

2(Z — 2)e? 

Vine tae (7.55) 
where Z is the atomic number of the nucleus, 2e is the charge of the alpha 
particle, (Z — 2)e is the charge of the nucleus left behind after the decay and £g is 
a fundamental constant called the permittivity of free space. This potential 
energy function is often called the Coulomb barrier. Notice that the Coulomb 
barrier exceeds the energy of the alpha particle in the region between r = 79 and 
r = rı in Figure 7.20 (where V(r) = Eq). In classical physics, the alpha 
particle does not have enough energy to enter this region, but in quantum physics 
it may tunnel through. Once beyond the point r = rı, the alpha particle is 
electrostatically repelled from the nucleus. 


To apply the quantum-mechanical theory of tunnelling to alpha decay, we first 
note that a classically confined particle would oscillate back and forth inside the 
well; the combination of its energy (Ea) and the nuclear diameter (279) implying 
that it is incident on the barrier about 107! times per second. Taking this idea over 
into quantum mechanics, we shall regard each of these encounters as an escape 
attempt. The small probability of escape at each attempt is represented by the 
transmission coefficient for tunnelling, T. To estimate T we must take account of 
the precise shape of the Coulomb barrier. We shall not go through the detailed 
arguments used to estimate T in this case, but we shall note that they involve the 
approximation 


TP as ces ( of va aes ar) i (1.56) 


ro 


where ro and rı are the minimum and maximum values of r for which 

V(r) > Ea. Equation 7.56 is closely related to the expression for tunnelling 
through a finite square barrier given in Section 7.3.3. If the potential energy 
function V (r) happened to be constant over a region of length L, then 

Equation 7.56 would reproduce the exponential term of Equation 7.53. The other 
factors in Equation 7.53 are not reproduced, but they vary so slowly compared 
with the exponential factor that they can be ignored for present purposes. 


For given values of Eq, Z and ro, Equation 7.56 can be evaluated using the 
Coulomb barrier potential energy function of Equation 7.55. After a lengthy 
calculation, including some approximations, the final result is of the form 


T m ae 2-2)/ Bal (7.57) 


where a and b are constants. Multiplying T by the number of escape attempts per 
second gives the rate of successful escape attempts, and this can be equated to the 
decay constant, A. So, according to the quantum tunnelling theory of alpha decay, 
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we have 
1/2 


Dx (4 -2)/ Bal (7.58) 


This agrees with the Geiger—Nuttall relation and a detailed comparison with 
experimental data is shown in Figure 7.21. The exponential-dependence on Eq, 1/2 
implies that a very wide range of decay constants is associated with a small range 
of emission energies. The sensitivity to energy is far greater than for a square 
barrier because of the shape of the Coulomb barrier; increasing the energy of the 


alpha particle decreases the effective width that must be tunnelled through. 


log(Ti j2/s)4 
207- 
16F 
127 
8 = 
4 = 
Q É 
ae Figure 7.21 A comparison of 
aL aa the Geiger—Nuttall relation with 
experimental data for different 
212 families of nuclei. In this plot, 
— > . . 
0.30 0.40 0.50 the straight lines confirm the 
(E,{Mev)_'/? exponential dependence of 


Tı J2 (and hence A) on Zo 


7.4.2 Stellar astrophysics 


If tunnelling out of nuclei is possible then so is tunnelling in! As a consequence it 
is possible to trigger nuclear reactions with protons of much lower energy than 
would be needed to climb over the full height of the Coulomb barrier. This was 
the principle used by J. D. Cockcroft and E. T. S. Walton in 1932 when they 
caused lithium-7 nuclei to split into pairs of alpha particles by bombarding them 
with high-energy protons. Their achievement won them the 1951 Nobel prize for 
physics. The same principle is also at work in stars, such as the Sun, where it 
facilitates the nuclear reactions that allow the stars to shine. Indeed, were it not 
for the existence of quantum tunnelling, it’s probably fair to say that the Sun 
would not shine and that life on Earth would never have arisen. 


The nuclear reactions that allow stars to shine are predominantly fusion reactions 
in which low mass nuclei combine to form a nucleus with a lower mass than the 
total mass of the nuclei that fused together to form it. It is the difference between 
the total nuclear masses at the beginning and the end of the fusion process that 
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(via E = mc?) is ultimately responsible for the energy emitted by a star. The 
energy released by each individual fusion reaction is quite small, but in the hot 
dense cores of stars there are so many fusing nuclei that they collectively account 
for the prodigious energy output that is typical of stars (3.8 x 10°° W in the case 
of the Sun). 


In order to fuse, two nuclei have to overcome the repulsive Coulomb barrier that 
tends to keep them apart. The energy they need to do this is provided by the 
kinetic energy associated with their thermal motion. This is why the nuclear 
reactions are mainly confined to a star’s hot central core. In the case of the Sun, 
the core temperature is of the order of 10’ K. Multiplying this by the Boltzmann 
constant indicates that the typical thermal kinetic energy of a proton in the solar 
core is about 1.4 x 10716 J ~ 1 keV. However, the height of the Coulomb barrier 
between two protons is more than a thousand times greater than this. Fortunately, 
as you have just seen, the protons do not have to climb over this barrier because 
they can tunnel through it. Even so, and despite the hectic conditions of a stellar 
interior, where collisions are frequent and above-average energies not uncommon, 
the reliance on tunnelling makes fusion a relatively slow process. 


Again taking the Sun as an example, its energy comes mainly from a process 
called the proton-proton chain that converts hydrogen to helium. The first step 
in this chain involves the fusion of two protons and is extremely slow, taking 
about 10° years for an average proton in the core of the Sun. This is one of the 
reasons why stars are so long-lived. The Sun is about 4.6 x 10° years old, 

yet it has only consumed about half of the hydrogen in its core. So, we have 
quantum tunnelling to thank, not only for the existence of sunlight, but also for its 
persistence over billions of years. 


7.4.3 The scanning tunnelling microscope 


The scanning tunnelling microscope (STM) is a device of such extraordinary 
sensitivity that it can reveal the distribution of individual atoms on the surface of a 
sample. It can also be used to manipulate atoms and even to promote chemical 
reactions between specific atoms. The first STM was developed in 1981 at 

the IBM Laboratories in Zurich by Gerd Binnig and Heinrich Rohrer. Their 
achievement was recognized by the award of the 1986 Nobel prize for physics. 


In an STM the sample under investigation is held in a vacuum and a very fine tip, 
possibly only a single atom wide, is moved across its surface (see Figure 7.22). 
Things are so arranged that there is always a small gap between the tip and the 
surface being scanned. An applied voltage between the tip and the sample tends to 
cause electrons to cross the gap, but the gap itself constitutes a potential energy 
barrier that, classically, the electrons would not be able to surmount. However, 
thanks to quantum physics, they can tunnel through the barrier and thereby 
produce a measurable electric current. Since the current is caused by a tunnelling 
process, the magnitude of the current is very sensitive to the size of the gap 
(detailed estimates can again be obtained using Equation 7.56). This sensitivity is 
the key to finding the positions of tiny irregularities in the surface, including 
individual atoms. 


7.4 Applications of tunnelling 


Figure 7.22 (a) A scanning tunnelling microscope (STM) surrounded by 
vacuum chambers. (b) A schematic diagram showing an STM tip in operation. 


In practice, the STM can operate in two different ways. 

In constant-height mode, the tip moves at a constant height 
and the topography of the surface is revealed by changes in 
the tunnelling current. In the more common constant-current 
mode the height of the tip is adjusted throughout the scanning 
process to maintain a constant current and the tiny movements 
of the tip are recorded. In either mode the structure of the 
sample’s surface can be mapped on an atomic scale, though 
neither mode involves imaging of the kind that takes place 

in a conventional optical or transmission electron microscope. 


STMs have now become a major tool in the developing field 
of nanotechnology. This is partly because of the images they 
supply, but even more because of their ability to manipulate Figure 7.23 
individual atoms, and position them with great accuracy. 
One of the products of this kind of nano-scale manipulation 
is shown in Figure 7.23, the famous ‘quantum corral’ formed 
by positioning iron atoms on a copper surface. 


Iron atoms on a copper surface 
forming a ring called a “quantum corral’. 
Standing waves of surface electrons trapped 
inside the corral are also visible. 


You can learn more about the science and technology of the STM by 
studying the multimedia package The scanning tunnelling microscope. Now 
would be the ideal time to do so. Note that the package contains a computer 
simulation and a video. 
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Summary of Chapter 7 


Introduction Scattering is a process in which incident particles interact with a 
target and are changed in nature, number, speed or direction of motion as a result. 
Tunnelling is a quantum phenomenon in which particles that are incident on a 
classically impenetrable barrier are able to pass through the barrier and emerge on 
the far side of it. 


Section 7.1 In one dimension, wave packets scattered by finite square barriers or 
wells generally split into transmitted and reflected parts, indicating that there are 
non-zero probabilities of both reflection and transmission. These probabilities are 
represented by the reflection and transmission coefficients R and T. The values of 
Rand T generally depend on the nature of the target and the properties of the 
incident particles. If there is no absorption, creation or destruction of particles, 
R+T=1. 


Section 7.2 Unnormalizable stationary-state solutions of Schrédinger’s equation 
can be interpreted in terms of steady beams of particles. A term such as 
Aei(k*—“") can be associated with a beam of linear number density n = |A}? 
travelling with speed v = hk/m in the direction of increasing x. Such a beam has 
intensity 7 = nv. In this approach, T = jeans /Jinc and R = Jeet /Jinc- 


For particles of energy Eo > Vo, incident on a finite square step of height Vo, the 
transmission coefficient is 


Aki ke 
7 (ky + kp)?’ 
where 
2mE 2m( Eo — Ve 
p= yen and ko = V Vo) 


are the wave numbers of the incident and transmitted beams. For a finite square 
well or barrier of width L, the transmission coefficient can be expressed as 
T= 4Eo(Eo + Vo) 
4Eo(Eo + Vo) + V2 sin?(k2L)’ 


where kg = \/2m(Eo + Vo)/h, with the plus signs being used for a well and the 
minus signs for a barrier. Transmission resonances, at which T = 1 and the 
transmission is certain, occur when k2 L = Nz where N is an integer. 


Travelling wave packets and steady beams of particles can both be thought of as 
representing flows of probability. In one dimension such a flow is described by the 
probability current 


. in / 09 _ av 
jo(@,t) = 2m (v Ox = Ox ) 


In three dimensions, scattering is described by the total cross-section, ø, which is 
the rate at which scattered particles emerge from the target per unit time per unit 
incident flux. For any chosen direction, the differential cross-section tells us the 
rate of scattering into a small cone of angles around that direction. At very high 
energies, total cross-sections are dominated by inelastic effects due to the creation 
of new particles. 


Achievements from Chapter 7 


Section 7.3 Wave packets with a narrow range of energies centred on Fo can 
tunnel though a finite square barrier of height Vo > Ep. In a stationary-state 
approach, solutions of the time-independent Schrödinger equation in the 
classically forbidden region contain exponentially growing and decaying terms of 
the form Ce~°* and De®”, where a = \/2m(Vo — Eo)/h is the attenuation 
coefficient. The transmission coefficient for tunnelling through a finite square 
barrier of width L and height Vo is approximately 


E E 
T ~ 16 (2) (1 =- =) e 70h provided that aL > 1. 
0 


Such a transmission probability is small and decreases rapidly as the barrier width 
L increases. 


Section 7.4 Square barriers and wells are poor representations of the potential 
energy functions found in Nature. However, if the potential V(x) varies smoothly 
as a function of x, the transmission coefficient for tunnelling of energy Fo can be 
roughly represented by 


Feus ( > f" ae — Eo) wr) l 


ro 


This approximation can be used to provide a successful theory of nuclear alpha 
decay as a tunnelling phenomenon. It can also account for the occurrence of 
nuclear fusion in stellar cores, despite the relatively low temperatures there. In 
addition, it explains the operation of the scanning tunnelling microscope which 
can map surfaces on the atomic scale. 


Achievements from Chapter 7 


After studying this chapter you should be able to: 
7.1 Explain the meanings of the newly defined (emboldened) terms and 
symbols, and use them appropriately. 


7.2 Describe the behaviour of wave packets when they encounter potential 
energy steps, barriers and wells. 


7.3 Describe how stationary state-solutions of the Schrödinger equation can be 
used to analyze scattering and tunnelling. 


7.4 Fora range of simple potential energy functions, obtain the solution of the 
time-independent Schrödinger equation and use continuity boundary 
conditions to find reflection and transmission coefficients. 


7.5 Present information about solutions of the time-independent Schrodinger 
equation in graphical terms. 


7.6 Evaluate probability density currents and explain their significance. 


7.7 Describe and comment on applications of scattering and tunnelling in a 
range of situations including: three-dimensional scattering, alpha decay, 
nuclear fusion in stars, and the scanning tunnelling microscope. 
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Introduction 


This mathematical toolkit provides support for some mathematical topics you will 
meet in the other chapters of the book. It deals with basic techniques involving 
complex numbers, ordinary and partial differential equations and probability. This 
is important material that you will need to know in order to progress freely with 
the physics of wave mechanics. 


8.1 Complex numbers 


This section supports In this section, ordinary numbers, stretching continuously from —oo to +00, are 
all the chapters in this called real numbers. Then a complex number is something that can be written in 
book and is best studied the form 

in conjunction with ; 
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where x and y are real numbers and i is a special quantity with the property that 
i? = —1. We shall refer to i as the square root of minus 1. Complex numbers of 
the form iy are called imaginary numbers. Real numbers, such as 2.7 or 4.8 are 
also special cases of complex numbers, with y = 0. 


Each complex number z = x + iy has a real part, Re(z) = x, and an imaginary 
part, Im(z) = y. Note that the imaginary part of a complex number is the real 
number y, not the imaginary number iy. Two complex numbers z and w are equal 
if, and only if, Re(z) = Re(w) and Im(z) = Im(w). 


We also define the modulus of the complex number z = x + iy to be 
Jz] = V2? + y?. (8.2) 


This is a real, non-negative quantity. 


Exercise 8.1 What are the real part, the imaginary part and the modulus of the 
complex number 3 — 4i? | 


When you first meet complex numbers, you may need to suspend your disbelief; it 
is clear that no real number can play the role of i, because the square of any real 
number is non-negative. The traditional terminology of ‘imaginary numbers’ 
scarcely helps since it seems to warn us that we are entering a mystical world that 
has nothing to do with reality. This is unfair and untrue. Admittedly, complex 
numbers do not play the role of ordinary numbers in counting and measuring 
things; no one has 3 + 2i eggs for breakfast! However, complex numbers can 
describe other aspects of the world we live in. 


To take a simple example, suppose that we associate +1 with looking East and —1 
with looking West; then looking North may be represented by the complex 
number i, and looking South by the complex number —i. The fact that i? = —1 
expresses the fact that the 90° rotation that converts looking East into looking 
North can be repeated twice in a row, and the end-result will be looking West. 
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Complex numbers sometimes appear in classical physics, but always in 
an auxiliary role, to help simplify calculations. The situation is quite 


different in quantum physics. Basic equations of quantum physics (including 


Schrédinger’s equation) involve complex numbers in an essential way and 
the quantum-mechanical wave function, U(x, t), has complex values. This 


wave function cannot itself be measured, but it is used to calculate real-valued 
quantities that can be measured. So the situation in quantum physics is rather 


unusual: complex numbers are unavoidable because they appear in fundamental 
equations, but real numbers are eventually extracted from the formalism so that 


quantum-mechanical predictions can be compared with real measured values. 


8.1.1 The arithmetic of complex numbers 


It is easy to carry out arithmetic using complex numbers. All the normal rules of 


arithmetic apply, with the additional rule that i is treated as an algebraic symbol 


with the special property that i? = —1. For example, we can add or subtract 


two complex numbers by adding or subtracting their real and imaginary parts 


separately: 
(2+ 3i) + (1 + 4i) = (2 + 1) + (3i + 4i) = 3 + 7i, 
(2+ 3i) — (14 4i) = (2 — 1) + (3i — 4i) = 1 — i. 


When multiplying complex numbers we simply need to remember to replace 


every occurrence of i? by —1. For example 
(2 + 3i)(1 + 4i) = (2)(1) + (3i)(4i) + (2) (4i) + (3) (1) 
= 2 — 12 + 8i + 3i = —10 + 11i. 


It is slightly harder to take the reciprocal of a complex number. To evaluate 


1/(x + iy), we multiply by the fraction (x — iy)/(x — iy) which, of course, is 


equal to 1: 
1 1 =j 
-= — x, 
x£ + 1y t+ +W ziy 


(8.3) 


The advantage of doing this is that the denominator simplifies to a real number: 


wte- iy) = 2” — (iy)(iy) = 2? + y’, 
so we have 
1 .. ey 


etiy r? +y? 


which is a complex number with real part x/(x? + y?) and imaginary part 
E (x? +y"). 
The quotient of any two complex numbers can be found in a similar way: 
a+ib _ a +ib : c—id E ac + bd ; 2c — ad 
c+id ctid c~id +e + 


Exercise 8.2 Given that z1 = 1 + 2i and z2 = 4 — 3i, find: (a) z1 + z2, 
(b) 221 = 322, (c) Z122 and (d) 21/22. 


8.1 


Complex numbers 
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imaginary Ay-axis 
axis 
ea +(x, ¥) 


2 > ` 
0 \ £ Y-axis 


real axis 


Figure 8.1 The complex 
plane or Argand diagram. 
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8.1.2 Complex conjugation 


The complex conjugate of a complex number z = x + iy is the complex number 
z* obtained by reversing the sign of i. Thus, 


z= x — iy. 


More generally, the complex conjugate of any expression involving complex 
numbers is obtained by reversing the sign of i throughout the expression, or 
equivalently, by replacing each part of the expression by its complex conjugate. 
This means, for example, that 


be\* re 
—) =a 8.4 
(o Po ) aE a (8.4) 
Two successive sign-reversals have no net effect, so we always have 
Ss. 


Complex conjugates are useful in finding the real and imaginary parts of complex 
expressions. Since 


z=Re(z)+ilm(z) and z* = Re(z) —ilm(z), 


we have 
* Ok 
Te and Im(z) = = 
2 2i 
These equations provide useful tests for whether complex expressions are real or 
imaginary. If an expression z is real, it has Im(z) = 0, and is characterized by 
z* = z. By contrast, an imaginary expression has Re(z) = 0, and is characterized 
by 2* = =z: 


Re(z) = 


From Equation 8.3, we also have 


z2* = (x +iy)(@ = iy) = 2° +y? = |2), (8.5) 
so the modulus of any complex expression z is given by |z| = v z2*. 
Exercise 8.3 Show that z = i(ab* — a*b) is real. m 


38.1.3 The geometry of complex numbers 


Complex numbers were invented as early as 1545, when Gerolamo Cardano used 
them to solve cubic equations, but it took much longer for them to be interpreted 
in geometric terms. Around 1800, Caspar Wessel and Jean Argand realized that 
complex numbers can be represented as points in a plane (Figure 8.1). 


In geometric terms, the complex number x + iy is represented by a point with 
Cartesian coordinates (x, y), and the plane consisting of all such points is called 
the complex plane or the Argand diagram. In this plane, the x-axis, which 
contains only real numbers, is called the real axis, and the y-axis, which contains 
only imaginary numbers, is called the imaginary axis. 


The geometric relationships linking a complex number z = x + iy, its complex 
conjugate z*, and the complex numbers —z and —z* are shown in Figure 8.2. The 
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operations of adding and subtracting complex numbers can also be visualized 
geometrically. Two complex numbers z and w are represented by the points 
shown in Figure 8.3. Their sum is represented by the point marked z + w at the 
corner of the blue parallelogram, while their difference is equal to the sum of z 
and —w and so is represented by the point marked z — w at the corner of the red 


parallelogram. 
AY-axis 
= 2 ar = ¢z=r+iy 
; > 
= 0 os Taig 
—x6é a ee ez” 
= 


Figure 8.2 The points z, z*, —z and Figure 8.3 Adding and subtracting 
—z* in the complex plane. two complex numbers z and w. 


So far, we have specified points in the complex plane by their Cartesian 
coordinates (x, y), and the corresponding complex numbers written as x + iy are 
then said to be in Cartesian form. However, we can also specify points in 

the complex plane by the polar coordinates r and 0 as shown in Figure 8.4. 
Anticlockwise rotations increase the value of 0, while clockwise rotations 
decrease it. Throughout this chapter, we shall assume that angles are expressed in 
radians, and that 0 is a pure number (of radians). Then, values of 0 that differ by 
27 are equivalent in that they refer to the same point in the complex plane. 


We have 
x=rcos@ and y=rsind, (8.6) 


so any complex number can be written in the polar form: 


z = r(cos 0 + isin 0). (8.7) 


complex number. The angular coordinate 0 is called the phase, or sometimes the 
argument, of the complex number. It can be found by solving either tan 0 = y/x 0 aa TARS 
or cos @ = x/r for 6, taking care to choose the angle to be in the appropriate 
quadrant of the complex plane. For example, the complex number z = 1 — i lies 
in the fourth quadrant, so we choose r = \/1? + (—1)? = V2 and 6 = —7/4 in 
this case, giving Figure 8.4 Polar coordinates 
; th for a point in the complex plane. 

1-i=v2 [ cos(—n/4) T isin(—7/4)]. The ara 0 shown = a aa 

value. 


. ; PE _ FD hiatha aad. nfen O 
The radial coordinate is given by r = \/x* + y^, which is the modulus of the b 
| 

8 


Exercise 8.4 Express the following complex numbers in polar form: 
(a) 1 +iv3, b) -1 + iv3. 


Exercise 8.5 Use Equation 8.7 to confirm that |z| = r. a 
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8.1.4 Euler’s formula and exponential form 


There is yet another way of expressing complex numbers, which is very important 
in quantum physics. To derive this alternative representation, we shall first 
consider a complex number of unit modulus (that is, one with |z| = r = 1). Such 
a complex number takes the simple form 


z = cos + isin 8. (8.8) 
Since all the normal rules of algebra apply to complex numbers, it is reasonable to 


suppose that the normal rules of differentiation apply too, provided that we treat 
the symbol i as a constant. We therefore have 


dz d Lo. 
v gg (C089 + isin 8) 
= (—sin 6 + i cos 8) 


= i(cos@ + isin 0) = iz. 
The general solution of this differential equation is 
z= 0e", (8.9) 
where C’ is a constant. Comparing Equations 8.8 and 8.9, we therefore have 
cos@ + isin@ = Ce”. 
Setting 0 = 0 on both sides of this equation, we see that 
1=Ce°=C, 


so we can identify C = 1 and write 
e = cos 0 +isind. (8.10) 


This result is known as Euler’s formula. It is a major mathematical discovery 
since it links two apparently different types of function — the exponential 
function and the trigonometric functions. For a real variable, 0, the exponential 
function ef is monotonically increasing, and does not display the periodic 
oscillations of sin 6 or cos 0. However, the exponential function with an imaginary 
argument if is directly related to sines and cosines via Euler’s formula. 


Exercise 8.6 Show that the cosine and sine functions can be represented as 


eil +4 e-i? ei? _ e-i0 
cos @ = —————_ and sin = ———_—__. 
2 2i 


Combining Euler’s formula with Equation 8.7, we see that any complex number 
can be written in the form 


z= re, (8.11) 


where r is the radial coordinate (the modulus) of the complex number, and @ is the 
angular coordinate (the phase). A complex number written in this way is said to 
be in exponential form. In quantum mechanics, a complex number of the form 
e'?, where 8 is real, is sometimes called a phase factor. Such complex numbers 
have unit modulus. 


Exercise 8.7 Express the complex number z = e? e~?”/6 in Cartesian 
p p 


form. | 


8.1 Complex numbers 


8.1.5 Multiplying and dividing complex numbers 


Although the Cartesian form, z = x + iy is useful for adding and subtracting 
complex numbers, other operations, such as multiplication, division and taking 
powers, are most easily carried out using the exponential form, as we shall now 
demonstrate. 


The exponential function has the property that 


ef x e? = ett? 


This rule applies even if a and b are complex, so we have 


rye x roel? = 17172 ei(91 +82) (8.12) 
To form the product of two complex numbers, we multiply their moduli and Moduli is the plural of modulus. 
add their phases. 


As before, the complex conjugate of a complex number is found by changing the 
sign of 1 wherever it appears, so we have 


= (re'®)* = re”, (8.13) 


To form the complex conjugate of a complex number we keep the modulus 
unchanged and change the sign of the phase. 


The product of a complex number z and its own complex conjugate z* is therefore 
given by 


gg re? x re`? = r2ei0-8) — p2e — r?, 


which in just what we would expect from Equation 8.5. 
The reciprocal of a complex number z = re’ is given by 
1 1 Ts 
eT (8.14) 


z ree r 


and we can check that this does the right job by verifying that 


1 , 1 PPE 
zx == re x —¢@ i? = gil-*) — e? = 1. 
KA r 


Note that the complex conjugate z* = re~' and the reciprocal 1/z = e~'*/r are 
not the same thing: they are complex numbers with the same phase but different 
moduli. 


Dividing one complex number by another is also easy in exponential form. Using 
Equations 8.12 and 8.14, we see that 


rye 1 Ti; = 
z = e2), (8.15) 
rgel’2 T2 


To form the quotient of two complex numbers, we divide their moduli and 
subtract their phases. 
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Figure 8.5 (a) The effect of 
multiplying a complex number 
w by re, (b) The effect of 
dividing a complex number w 
by ret. In the cases shown we 
have taken r = 2 and 0 = 7/4. 
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Figure 8.5a shows the geometric effect of multiplying one complex number by 
another: multiplying by z = re’ causes stretching by a factor of r and rotation 
through an angle of 0 radians. Similarly, Figure 8.5b shows the geometric effect 
of dividing one complex number by another: dividing by ret? causes stretching by 
a factor of 1/r and rotation through an angle of —0 radians. 


y-axis A NAS. Y-AXIS A 


> A > 3 
0 T-AXİS 0 Y-axis 


(a) (b) 


Exercise 8.8 Describe the geometric effects of (a) multiplying and (b) 
dividing a complex number by i. a 


8.1.6 Powers of complex numbers 


The exponential function has the property that 

(e*)” = a 
where n is any number. This rule applies even if a is complex, so it is easy to take 
the square of a complex number in exponential form: 

. 2 t 

2? = (re) = Pe, 
and more generally, 

C (8.16) 
for any integer n. 


Exercise 8.9 Given that z = (1 + i)/v2, what is 210°? a 


A slight complication arises if n is fractional. For example, suppose we want to 
find the square root of z = ret’. Direct use of Equation 8.16 gives 


2/2 = pl/2gi8/2, (8.17) 


where we take r!/? to be real and positive. This answer is certainly appropriate 
for a square root because we have 


(r1/2¢36/2)? Srel a 


However, this is not the only square root of z. To see why, remember that any 
integer multiple of 27 can be added to the phase of ret? without affecting the 
complex number being represented. We therefore have 


z = re = r elt), 
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Hence, taking the square root also gives 
z1/2 — pl/2 gi(0+2m)/2 — p1/2 gi0/2gim 
Since e” = cos m + isin m = —1, we conclude that z1/2 has the alternative value 
2/2 = _ pl /2gi8/2 (8.18) 
and this also makes good sense, since 


(21/2)? = = ri/29i@/2)? ples oe 


You might well have guessed that the square root could appear with a plus sign or 
minus sign, but the procedure we have just introduced can be generalized to find 
the nth root of any complex number. For example, the complex number z = re? 
has three cube roots: 


pl/3gi8/3 pl /3gi(0+2m)/3 and pl /3gi(6-+4n)/3_ 


Figure 8.6 shows the three cube roots of —1 as points in the complex plane. 


8.2 Ordinary differential equations 


A differential equation is an equation that involves derivatives of an unknown 
function that we would like to find. For example, the aim of solving the 
differential equation 

“u =a" y(a), 
is to find an expression for the dependent variable y in terms of the independent 
variable x. If the equation involves functions of more than one variable and their 
partial derivatives, it is a partial differential equation. If it involves only ordinary 
derivatives with respect to a single independent variable, it is an ordinary 
differential equation — although the adjective ‘ordinary’ is often omitted for 
brevity. Ordinary differential equations are the subject of this section. 


8.2.1 Types of differential equation 


The only type of differential equation we need to consider takes the form 


a(z) 3 + ble) T + clx) y(x) = f(x). (8.19) 


This is a general second-order linear differential equation. 


e The order of a differential equation is the order of the highest derivative that 
appears in it. Assuming that a(x) is not identically equal to zero, the derivative 
of highest order in Equation 8.19 is d2y/da?, so this equation is of second 
order. 


e A linear differential equation is one that can be written in the form 


Ly(x) = f(a), (8.20) 


—/3/2}------ 


Figure 8.6 The three cube 
roots of —1 are shown as dots at 
(—1, 0), at (1/2, +/3/2) and at 
(1/2, -V3/2). 


This section supports the 
study of Chapters 2, 3 
and 7. 
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An example of Equation 8.24 
can be found in Chapter 1, 
where is the decay constant of 
a radioactive nucleus. 
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where L is a linear differential operator — that is, one that can be expressed 
as 


d”? n—l1 


d 
gar + in-1(2) gga +++ tale) +ao(x), (8.21) 


where an(x), an—1(2)...ao(x) are functions of x (some of which may be 
constant or equal to zero). The crucial feature of a linear differential operator is 
that it gives 


L layı (£) + byl) =a Ly, (x) + 8 Lyla) (8.22) 
for any smooth functions y;(x) and y2(x) and any constants a and £. 


We are usually interested in the special case of Equation 8.19 that arises when 
f(x) = 0 for all x, so that 


a(x) z + b(0) 4 + c(x)y(x) = 0. (8.23) 


An equation of this type is said to be homogeneous. It is not sufficient to say that 
a homogeneous equation is one with zero on the right-hand side, as this could 
always be achieved by a rearrangement of terms. The distinguishing feature of a 
homogeneous equation is that each of its terms is proportional to y or one of 

its derivatives. The simplest case occurs when the coefficients, a, b and c are 
constants. We then have a second-order linear homogeneous equation with 
constant coefficients. The description is certainly a mouthful, but such equations 
are relatively easy to solve, and you will see examples of this shortly. It is usually 
much harder to solve equations with non-constant coefficients, but special tricks 
can sometimes be used, and you will see an example of this in Chapter 5. 


8.2.2 General properties of solutions 


In general it is found that a given differential equation is satisfied by a family of 
functions that can be described by a single comprehensive formula involving 
arbitrary constants. A solution to a differential equation of order n contains n 
arbitrary constants. The arbitrary constants do not appear in the differential 
equation itself but serve to distinguish one solution from another. For example, 
the differential equation 


dy _ 

dt 
is satisfied by a family of functions of the form y(t) = Ae~*’, where A is an 
arbitrary constant; À is not an arbitrary constant but is a parameter of the equation. 


—dy/(t) (8.24) 


The solution that includes the arbitrary constants is called the general solution 
of the differential equation, while a solution that corresponds to a particular 
choice of arbitrary constants is called a particular solution. In a given physical 
situation, we may have enough additional information to determine the values of 
the arbitrary constants. For example, in the situation described by Equation 8.24, 
if we know that y = 10 when t = 0, we can deduce that A = 10, and therefore 
select the particular solution y(t) = 10 e7%. 


Linear homogeneous differential equations have an important property that 
follows from Equation 8.22. If yı (x) and y2(x) are two solutions of a linear 
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homogeneous differential equation, we have 
Lyi()=0 and Lyo(x) =0, 
so it follows that 
Llay: (x) + By2(x)] = aly (x) + 6Lyo(x) = 0, 
for any constants a and 8. Thus, any linear combination of solutions is also a 


solution of the differential equation. This fact is called the principle of 
superposition. 


8.2.3 Solutions of some basic differential equations 


When considering the quantum-mechanical behaviour of a particle in a region of 
constant potential energy, you will meet second-order differential equations of the 
form 
2 
<4 + k*y(x) = 0, (8.25) 
where k is a real constant. We will consider first the case where the plus sign 
appears and Equation 8.25 takes the form 


d*y 


dz? 
To solve an equation of this type, we can substitute in a trial function of the form 
y(x) = e*, where a is a constant that remains to be determined. Evaluating the 
second derivative of this trial function, we see that 

d?y B d? 

dz? da? 
Substituting into Equation 8.26 then leads to the equation 

(eee =0. 
which implies that 

a? +k? =0. (8.27) 


This algebraic equation is called the auxiliary equation corresponding to the 
original differential equation. Solving the auxiliary equation for a, gives a = +ik, 
so we conclude that solutions of the form y(x) = e'** or y(x) = e7'** will satisfy 
Equation 8.26. This differential equation is linear and homogeneous, so the 
principle of superposition tells us that it also is satisfied by any function of the 
form 


+ k?y(x) = 0. (8.26) 


y(x) = Aet? + Be ike, (8.28) 


where A and B are arbitrary constants (which may be real or complex). 
Equation 8.28 is the general solution of Equation 8.26; it contains two arbitrary 
constants, as expected for the general solution of a second-order differential 
equation. 


Using Euler’s formula (Equation 8.10) with 0 = kz, allows us to write 
Equation 8.28 in the alternative form 


y(x) = Ccos(kx) + Dsin(kz), (8.29) 
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Equations 8.33 and 8.34 are the 
hyperbolic analogues of the 
trigonometric relations derived 
in Exercise 8.6. 


The prime in y(x) indicates 
that the function y(x) is 
differentiated with respect to its 
argument z. 
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where C = A+ Band D = i(A — B) are also arbitrary constants. Equations 8.28 
and 8.29 are alternative descriptions of the general solution of Equation 8.26. You 
will see in the physics chapters that each of these descriptions has advantages 
under different circumstances. 


A very similar analysis can be carried out for the differential equation 


—* — kyls) = 0. (8.30) 


This time, substituting in a trial solution of the form y(x) = e°” leads to the 
auxiliary equation 


a? —k? =0, (8.31) 


which has solutions a = +k. Consequently, Equation 8.30 is satisfied by 
y(x) = e* and y(x) = e~**, and its general solution can be written as 


y(x) = Ae"? + Be7**, (8.32) 


where A and B are arbitrary constants. 


The exponential functions e¥? and e—** are related to the so-called hyperbolic 
functions: 
ka —ka 
cosh(ka) = — (8.33) 
kz _ a—kr 
sinh(kx) = A (8.34) 


and these allow us to express the general solution of Equation 8.30 in the 
alternative form 


y(x) = Ccosh(kx) + Dsinh(kz), (8.35) 
where C = A + B and D = (A — B) are further arbitrary constants. 


8.2.4 Boundary conditions and eigenvalue equations 


It is sometimes possible to refine the general solution of a differential equation to 
obtain the particular solution that applies under a given set of circumstances. To 
achieve this we need additional information so that we can determine the arbitrary 
constants. For a second-order differential equation it is sufficient to know the 
value of the dependent variable, y, and its derivative, dy/dx at some value of the 
independent variable, x. 


For example, the general solution of d2y/da? + k?y(x) = 0 is 


y(x) = Ccos(kx) + D sin(kz), (Eqn 8.29) 
so it follows that 
y (x) = w = —kC sin(kx) + kD cos(kzx). 


Now, if we happen to know that y(0) = 1 and y'(0) = 2, we can substitute these 
values into the above equations to obtain C = 1 and D = 2/k. The particular 
solution consistent with these conditions is therefore 


y(x) = cos(ka) + = sin(k2). 


8.2 Ordinary differential equations 


Conditions that supply extra information about the solution of a differential 
equation at a single value of the independent variable are called initial conditions. 
More generally, any conditions that give extra information about the solutions are 
called boundary conditions. For example, the information that y(x) is a smooth 
function of x that does not diverge as x — +00 is typical of the boundary 
conditions you will meet in quantum mechanics. 


In quantum mechanics, boundary conditions also occur in a context that does not 
lead to a unique particular solution. You will often meet linear homogeneous 
differential equations of the form 


Ly(a) = Ay(2), (8.36) 


where L is a linear operator and the constant ÀA has an unknown value. In this 
context, the boundary conditions give us information about the possible values of 
A (which are known as eigenvalues) and the possible solutions y(x) (which are 
known as eigenfunctions). Equations like this are called eigenvalue equations. 
An example of an eigenvalue equation is 

d?y 

— = Ay(2). 8.37 

qaz 7 (2) (8.37) 
This is identical to Equation 8.25, except that the constant À is now regarded as an 
undetermined parameter that is to be constrained by the boundary conditions. To 
take a specific case, let us suppose that the boundary conditions are: 


y(0)=0 and y(L)=0. 


There are three cases to consider: \ = 0, \ = k? and A = —k? (where without 
any loss in generality we can take k to be positive). First, if A = 0, the differential 
equation reduces to d2y/da? = 0, which has the general solution 


y(x) = A+ Ba, 


where A and B are arbitrary constants. In this case, the boundary conditions give 
A = B = 0, and we are left with the solution y(x) = 0 for all x. Trivial solutions 
such as this are usually of no physical interest and are not counted as acceptable 
solutions of the eigenvalue equation. 


The next case, \ = k?, produces the general solution 

y(x) = Ae"? + Be7**, (Eqn 8.32) 
In this case, the boundary conditions give 

A+B=0 and Ae*” + Be™ =0, 


but there is no way of satisfying these equations since they imply that e?*” = 1, 
which cannot be satisfied by any non-zero k. 


Finally, we consider the case \ = —k?, which produces the general solution 
y(x) = Ccos(kx) + Dsin(kz). (Eqn 8.29) 
Now, the boundary conditions give 
C=0 and DsinkL =0. 


Rejecting the trivial solution in which C = D = 0, and noting that the case 
k = 0 has already been dealt with, we conclude that kL = nz, where n is a 
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non-zero integer. Since k has been taken to be positive, we can restrict n to the 
positive integers 1,2,3,.... It therefore follows that the differential equation has 
acceptable solutions (consistent with the boundary conditions) if and only if 


A= k? = -n?n /L? forn =1,2,3,.... 


These special values of À are the eigenvalues of the problem and the 
corresponding eigenfunctions are y,(x) = Dn sin(nax/L), where we have used 
the integer n to label different eigenfunctions and replaced D by Dp to allow for 
the possibility that different arbitrary constants may be chosen for different values 
of n. 


In an eigenvalue problem such as this, the boundary conditions do not lead to 

a unique particular solution, free from all arbitrary constants. Instead, they 
restrict the possible values of the parameter A that appears in the differential 
equation, requiring it to be one of the allowed eigenvalues. Different eigenvalues 
correspond to different eigenfunctions, and each eigenfunction yp (x) still contains 
an arbitrary constant, Dn, whose value remains to be decided. 


It is also worth noting that the principle of superposition does not imply that 

a linear combination of eigenfunctions with different eigenvalues is also an 
eigenfunction. If two eigenfunctions have different eigenvalues, they satisfy 
different differential equations, so the principle of superposition does not apply in 
this case. 


It is remarkable how the process of solving a differential equation yields a 
simple set of numbers — the eigenvalues. Schrédinger was well aware of this 
phenomenon from his knowledge of waves in classical physics, where the 
frequencies of vibrating strings and organ pipes can be related to the eigenvalues 
of differential equations. He therefore set himself the task of finding an 
appropriate eigenvalue equation for atomic physics, where the eigenvalues 
would give the allowed energies of atoms; the result was the time-independent 
Schrödinger equation, which is a major topic in this book. 


Exercise 8.10 Suppose that the eigenvalue equation given in Equation 8.37 is 
supplemented by the boundary conditions y(0) = y (L) = 0. What are the 
eigenvalues and eigenfunctions in this case? = 


The above exercise illustrates an important general point: eigenvalues and 
eigenfunctions depend on the boundary conditions. We need to know the 
eigenvalue equation and its associated boundary conditions to specify an 
eigenvalue problem. 


8.3 Partial differential equations 


8.3.1 Partial differentiation 


If f(x, t) is a function of the two independent variables x and t, we can 
differentiate f (x,t) with respect to x while treating t as a constant. The result of 
this differentiation is called the partial derivative of f with respect to x, and is 
written as Of /Ox. Similarly, we can differentiate f(x, t) with respect to t while 
treating x as a constant; this gives the partial derivative of f with respect to t, 
written as Of /Ot. 


8.3 Partial differential equations 


For example, consider the function 


f(az,t) = sin(kax — wt), (8.38) 
where k and w are constants. In this case, partially differentiating with respect to 
x gives 

o 

ol = kcos(kx — wt), (8.39) 
while partially differentiating with respect to t gives 

o 

a = —wcos(kx — wt). (8.40) 


This is no harder than ordinary differentiation provided that we remember to treat 
t as a constant in the first differentiation and x as a constant in the second. In 
graphical terms, these partial derivatives represent the gradients of f (x,t) in the 
x- and t-directions respectively, as illustrated in Figure 8.7.Partial derivatives can 
be evaluated for functions of any number of variables (not just two); each partial 


Figure 8.7 Geometrical 
interpretation of the partial 
derivatives Of /Ox and 

Of /Ot. Parts (a) and (b) are 
three-dimensional plots of the 
function f(a, t) while (c) is the 
corresponding contour map. The 
partial derivative Of /Ox is the 
rate of change of f in the 


slice through 
surface at 
constant t 


a 

o) direction of the arrow in (a) 
while Of /Ot is the rate of 
change of f in the direction of 
the arrow in (b). 

(b) direction of 

increasing t 
t-axis 


direction of 
increasing £ 


7f-axis 
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derivative is obtained by differentiating the function with respect to one of its 
variables while keeping all the other variables constant. 


The partial derivatives Of /Ox and Of /Ot are called first-order partial 
derivatives because they involve a single partial differentiation. These partial 
derivatives are themselves functions of x and t, so they can be partially 
differentiated again. For a function of two independent variables, f (x,t), there 
are four second-order partial derivatives: 


Hr a (at) Hf 2 (at) 

ðr? On \. Ox at at \ at)’ 

Pf a (af ef a (af 

tox ðt (54) OxOt Ox (2) l An 


However, for well-behaved functions (and this includes any functions you will 
need to differentiate in this course) it turns out that the sequence of partial 
differentiation is irrelevant: 

3? f _ OF 

atox = Oxdt’ 


so, in practice, only three of the second-order partial derivatives are independent. 


(8.42) 


Exercise 8.11 Find the four second-order partial derivatives of the function in 
Equation 8.38 and verify that Equation 8.42 is valid in this case. | 


8.3.2 Partial differential equations 


A partial differential equation is an equation that contains partial derivatives of 
an unknown function that we would like to find. For example, the equation 


Pf 18f 

Ox2 e OR?” 
where c is a constant, is a partial differential equation for the function f(z, t). 
This equation governs the propagation of some one-dimensional waves. 


(8.43) 


Partial differential equations can be classified in much the same way as ordinary 
differential equations. For example, Equation 8.43 is a second-order linear 
homogeneous partial differential equation. It can be expressed in the form 


Lf(x) =0, 
where 
R 2 2 
p- _ 1% 
Ox? ət? 


is a linear (partial) differential operator. 


Linear homogeneous partial differential equations satisfy the principle of 
superposition (the switch from ordinary differentiation to partial differentiation in 
no way affects our previous argument). So, if fı(x,t) and fo(z, t) are both 
solutions of a linear homogeneous partial differential equation, then so is the 
linear combination af) (x,t) + Gfo(x,t), where a and ( are any constants. This 
principle is a central feature in quantum physics, and it underlies some important 
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physical effects, including interference phenomena and the behaviour of quantum 
computers. 


The general solution of a partial differential equation can be very general indeed, 
encompassing wide classes of function that cannot be characterized by a few 
arbitrary constants. For example, Equation 8.43 is satisfied by any well-behaved 
function of the form f(x, t) = g(x + ct). The only restriction on the function g 
is that we should be able to differentiate it twice. Using the principle of 
superposition, we see that any function of the form 


f (x,t) =ag(a— ct) + P g(x + ct), (8.44) 


where a and 8 are constants, satisfies Equation 8.43 (and is, in fact, the general 
solution of this partial differential equation). 


The boundary conditions needed to specify a particular solution of a partial 
differential equation are much more extensive than those needed for an ordinary 
differential equation. In the case of Equation 8.43, sufficient information would 
be provided by the initial (t = 0) values of the function f(x, t) and its partial 
derivative Of /Ot, but we would need to know these values for all x. 


Exercise 8.12 By differentiating twice with respect to x, and twice with 
respect to t, show that any function f(x,t) = g(x — ct) of the single variable 
x — ct satisfies Equation 8.43. a 


8.3.3 The method of separation of variables 


Finally, we shall describe a method that allows us to find some solutions of some 
partial differential equations (we can put it no more strongly than that). The 
method will be illustrated for Equation 8.43. 


The basic idea is to look for solutions that are products of functions of a single 
variable. In the present case, we shall look for solutions of the form 


(x,t) = X(T), (8.45) 


and then try to obtain and solve ordinary differential equations for the functions 
X(x) and T(t). This tactic is called the method of separation of variables. 


Partially differentiating Equation 8.45 twice with respect to x gives X” (x)T (t), 
where the primes denote differentiation of a function with respect to its argument. 
Similarly, partially differentiating twice with respect to t gives X (x)T" (t). When 
these results are substituted into Equation 8.43, we obtain 


X"(2) T(t) = 2 X(a)T"(), 


Dividing both sides by X (x)T (t) then gives 
A" (ge). - Los) 


Xl) 2 T(t)” on) 


Now comes the inspired step. We notice that the left-hand side of Equation 8.46 
depends only on x, while the right-hand side depends only on t. How can a 
function of x be equal to a function of t for all values of x and t? This can only 
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happen if both sides of the equation are equal to the same constant, A, say. We can 
therefore write 

ae) Ae) 

Xæ) e T(t)” 
where the undetermined constant A is called the separation constant. Our partial 
differential equation therefore splits into two ordinary differential equations: 

d? X dT 5 
These are eigenvalue equations of the type met in Section 8.2. The allowed values 
of À may be restricted by the boundary conditions. To take a definite case, let us 
suppose that À is negative. Then we can write A = —k? and express the solutions 
as 


X(x) = Acos(kx) + Bsin(kz) 

T(t) = Ccos(kct) + Dsin(kct). 
Equation 8.43 then has a solution of the form 

f(z, t) = |Acos(kx) + B sin(kx)]| [C cos(kct) + D sin(ket)], (8.48) 
where A, B, C and D are arbitrary constants. 


The method of separation of variables has two significant limitations. First, it 
cannot be used for all partial differential equations. If the equation involves a term 
such as e~* /©*, for example, substitution of a product function X (x)T(t), 
followed by rearrangement, will never produce an equation like Equation 8.46 
where the terms in x appear on one side of the equation, and the terms in t appear 
on the other side. Equations for which such a separation of variables can be 
achieved are called separable. The second limitation is that the method does not 
give the most general solution of the partial differential equation. It focuses 
attention on solutions of the form X(x)T(t), but it would be idle to suppose 

that all solutions have this form — clearly they do not, as demonstrated by 
Equation 8.44. 


Nevertheless, the method of separation of variables is of major importance. One 
reason is that product solutions of the form X(x)T(t) are useful in many physical 
applications. In classical physics, they represent standing waves with fixed nodes; 
in quantum physics, they represent states of definite energy. Another reason is 
based on the principle of superposition. If the partial differential equation is 
linear and homogeneous, the principle of superposition allows us to use product 
solutions as building blocks from which more complicated solutions can be 
constructed. In many cases, the most general solution of a partial differential 
equation can be obtained by taking an arbitrary linear combination of the product 
solutions that emerge from the method of separation of variables. 


8.4 Probability 
8.4.1 The concept of probability 


Many everyday experiments or trials have uncertain results or outcomes; 
examples include the result of tossing a coin, the hand you are dealt in a card 


8.4 Probability 


game or tomorrow’s weather in London. The first aim of a theory of probability is 
to assign numerical probabilities to outcomes in a way that quantifies their 
likelihood. 


The scale of probability runs from zero (for outcomes that are impossible) 

to 1 (for outcomes that are certain). If the probability of an outcome is 0.5, it 

is as likely to happen as not (Figure 8.8). More generally, an outcome has a as likely 
probability of p if it is expected to happen p x 100% of the time. For experiments possible as not certain 
that can be repeated endlessly, the fraction of times a particular outcome occurs is li 
expected to approach its probability as the number of repetitions of the experiment 0 0.5 1 P 
approaches infinity. In the long run, for example, we expect to get heads in about 

50% of the tosses of an unbiased coin, corresponding to p = 0.5 for this outcome. Figure 8.8 The scale of 
probability 0 < p < 1. 


It is remarkable that unpredictable events can be described by such a simple 
concept as probability. A commonly-held (but erroneous) view is that a long run 
of tails will increase the chances of getting heads on the next coin toss, because (it 
is said) ‘the laws of chance tend to even things up’. This is not how blind chance 
works: the probability of tossing heads is 0.5 irrespective of the past history of 
coin tosses. It is reasonable to suppose that heads will occur roughly 50% of the 
time in the very long run, but this happens because any initial run of tails will be 
diluted and eventually swamped by a much longer run of roughly equal numbers 
of heads and tails that follows. 


Some experiments (such as the running of an Olympic race) can never be repeated 
under exactly the same conditions. In such cases, a more suitable definition 

of probability is based on the fair odds for a wager. Suppose that you place 

0 < p < 1 pounds into a kitty, and I place (1 — p) pounds into it, on the 
understanding that you will collect the kitty if one particular outcome occurs, and 
I will collect the kitty if it does not. Then, p is equal to the probability of the 
outcome if the bet is fair to both of us. This way of looking at probability implies 
that it has a meaning even for one-off, unrepeatable events. When we use the 
concept of probability in quantum physics, it suggests that we can talk about 
probability for a single atom or electron. 


If a given outcome has a definite probability, and no further information is 
available to us about whether it will occur or not, the outcome is said to be 
random. In everyday life, tossing heads on a coin may be taken to be random. If 
we took enough trouble to observe exactly how the coin was tossed we could, in 
principle, predict whether it would land with heads uppermost, but the necessary 
information is not usually collected, so we can treat the outcome as random. A 
very different situation arises in quantum physics. Quantum physics predicts the 
probabilities of the results of measurements, but these probabilities are regarded 
as being fundamental; we believe that the basic laws of Nature are expressed in 
terms of probabilities. In quantum physics, probabilities do not arise through lazy 
or inadequate observations, or because of a lack of computational power, but are 
believed to be intrinsic to way the world works. 


8.4.2 Adding probabilities 


Suppose that we know the probabilities of a set of outcomes. For example, 
we normally assign a probability of 1/6 to each of the six spot numbers on a 
die, trusting that the die is not loaded. We still have the problem of assigning 
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probabilities to more complicated (compound) outcomes. What is the probability 
of rolling a number less than 4, for example? 


We say that the outcomes of a trial are mutually exclusive if the occurrence of 
one of the outcomes automatically excludes the possibility of any of the other 
outcomes in the same trial. For example, the six different spot numbers on a die 
are mutually exclusive because, if you roll a 5 you cannot, at the same time, roll a 
3. The following rule applies to mutually exclusive outcomes: 


The addition rule for probability 


If Ay, Ag,..., An are a set of mutually exclusive outcomes with 
probabilities p1, p2,..., Pn, then the probability P of obtaining either A, or 
Ag or... An ina single trial is the sum of the individual probabilities: 


n 
PO pikot mn Y Mm. (8.49) 
g= 


A set of mutually exclusive outcomes is said to be complete if it covers every 
possible outcome of the experiment. In this case, we are certain to get one of these 
outcomes, so P = 1 in Equation 8.49. We therefore have 


Saai (8.50) 
i=1 


where the sum is over all the mutually exclusive outcomes. This is called the 
normalization rule for probabilities. This condition requires us to sum over 

all the possible outcomes in a mutually exclusive set. Failing to sum over all 

the outcomes would give a probability less than 1, and if the outcomes were 

not mutually exclusive, there would be some double-counting. Fortunately, 
experiments in quantum mechanics generally do produce mutually exclusive 
outcomes — when the energy of an atom is measured, for example, the result of 
the measurement is a definite energy value, and this automatically excludes other 
values. 


Exercise 8.13 An experiment has only three outcomes and they are mutually 
exclusive. The outcomes occur with probabilities in the ratio 1 : 3 : 6. What are 
their probabilities? E 


8.4.3 Average values and expectation values 
Average values 


If a given quantity A, with a discrete set of possible values, A1, Ao, ..., An, is 
repeatedly measured in a given situation, we can record the number of times each 
value occurs. If the measurement of A is repeated N times, and the value A; is 
obtained on N; occasions, we define the average value or mean value of A to be 


A= 


Nidi + Node + cc Net Tae 
tAr + Ng Ag + + B ŞO N;A;. (8.51) 
i=1 


N o N£ 
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We can also define the relative frequency f; of value A; to be the fraction of 
times that this value occurs. We then have 


Ni k 
f= a where y Ji=1: 
i=1 


In terms of the relative frequencies, the average value is given by 
n 
A= fiAi + foAe ar ooo SF fn An = > fidi. (8.52) 
i=1 


Average values provide a very convenient way of characterizing the results of 
repeated measurements. 


Expectation values 


When predicting the result of measuring of a quantity A, we may know that a 
number of different values, A1, A2, ..., An are possible, with each value A; being 
characterized by a probability p;. The complete set of probabilities is said to be a 
probability distribution for the measurement. For a set of discrete results, a 
probability distribution may be represented by a bar chart, as in Figure 8.9. 


a probability 


| a Figure 8.9 A bar chart 
0 A representing a discrete 
probability distribution. 


Sometimes, we do not need to know the full detail of a probability distribution; it 
is sufficient to characterize the distribution by a value that can be compared 
easily with the average value of a set of measurements. We therefore define the 
expectation value of a probability distribution to be 


(A) = pi Ai + poA +...+ Dn An = Ss (8.53) 
= 


where the sum runs over all the possible values for A. 


The expectation value (A) is a theoretical prediction for the average value A. You 
should not be too surprised if, over a finite sequence of measurements, the 
average value and the expectation value differ from one another slightly; random 
fluctuations are in the nature of chance. However, as the number of measurements 
increases, we expect the relative frequency f; to approach the probability p;, so in 
the long run, the average value is expected to approach the expectation value. 
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It is worth noting that the expectation value need not be any of the possible values 
Aj, Ao, ..., An. For example, if A is the score rolled on a die, the expectation 
value of A is 


Ma ei Via 6s ete ee RSS 
6 6 6 6 6 6 ee 


This result could never be obtained on a single roll. Under some circumstances, 
the expectation value need not be close to the most likely value. For the skew 
distribution shown in Figure 8.10, the expectation value (A) is shifted far away 
from the peak, into the long tail of the distribution. Nevertheless, the expectation 
value represents our best estimate of the average value for a long sequence of 
measurements of A. 


aprobability 


l | E > 


most likely 
valuc of A 


Figure 8.10 The expectation value (A) need not coincide with the peak (most 
likely) value of the distribution. 


8.4.4 Standard deviations and uncertainties 
Standard deviations 


It is often important to characterize the spread of values of a quantity around its 
average value. This is generally done by introducing a quantity called the standard 
deviation. If the quantity A has a discrete set of possible values, A;, which occur 
with relative frequencies f;, we define the standard deviation of A to be 


oaa eo. 
HA 


In words: we take the deviation A; — A of each value from the average value, 
square the results, find the average of these squared deviations, and finally take 
the square root. Taking the square root in the last step ensures that the standard 
deviation has the same units as A. (Note that it would be useless to consider the 
average value of the deviations without taking their square first, since this would 
always give zero, by definition of the average value.) 
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Uncertainties 


The spread of a probability distribution around its expectation value is 
characterized by a quantity called the uncertainty. This is defined in a similar way 
to the standard deviation, but using the theoretical probabilities rather than the 
measured relative frequencies. For a quantity A, whose possible values A; have 
probabilities p;, the uncertainty in A is defined by 


1/2 1/2 


apaa = [(a—(ayyy] (8.54) 
(=i) 


From this definition, it is clear that uncertainty is a theoretical prediction for the 
standard deviation. The standard deviation is expected to approach the uncertainty 
in the long run, as the number of measurements increases. 


An alternative way of calculating uncertainties is often used. Squaring both sides 
of Equation 8.54 and expanding out (A — (A))?, we see that 


(AA)? = ((A— (A))*) 
= (A? — 2A(A) + (A)*) 
= (A?) — 2(A)(A) + (A)? 
= (A?) — (A)’, 
so the uncertainty in A can also be expressed as 
AA = [(A%) — 4}. (8.55) 
This formula generally provides the most efficient way of calculating uncertainties 


in quantum mechanics. 


We stated earlier that the average value A need not be exactly equal to the 

expectation value, (A), but that the difference between these two quantities is 

expected to become very small when the number of measurements becomes very 

large. We can now quantify this statement. If N measurements are taken (with N 

greater than about 25), it can be shown that the probability of finding an average 

value A that deviates from the expectation value (A) by more than a fraction f of This result is a consequence of 
the uncertainty AA is less than one in a million if N > 25/ f?. This is true no the central limit theorem in 
matter what the shape of the probability distribution. Statistics. 


Exercise 8.14 Calculate the standard deviation associated with rolling a die by 
(a) using Equation 8.54 and (b) using Equation 8.55. E 


8.4.5 Continuous probability distributions 


So far, we have considered a discrete set of possible outcomes, which can be 
labelled A1, A2, . ... However, we can also consider situations in which the set of 
possible outcomes forms a continuum. 


For simplicity, we shall consider the example of measuring the position x of a 
particle in one dimension, where x can take any value in a continuous range. In 
this case, it is appropriate to introduce a probability density function p(x), 
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defined so that the probability of obtaining a value of x in a small range of width 
dx, centred on x, is p(x) dx. Figure 8.11 shows an example of a probability 
density function. The probability that x lies between a and b is given by the 
integral 


b 
ol p(z)dz, (8.56) 


which is the area of the shaded strip in Figure 8.11. The area under the whole 
curve is then given by 


T prjdr=1, (8.57) 


—oo 


This is the normalization rule for a continuous set of outcomes. 


Figure 8.11 A continuous probability distribution p(x). 


In general, everything is similar to the discrete case, except that sums are replaced 
by integrals. For example, the expectation value of x is given by 


co 
(x) = p(x)zdz, (8.58) 
—oo 
and the uncertainty of x is given by 
Ax = [(2?) - (2)"]"”, (8.59) 
where 
co 
(x?) = / p(x)x? da. (8.60) 
—oo 


These results can easily be extended to three dimensions, using a probability 
density function that depends on three coordinates, x, y and z and integrating over 
all of these coordinates. 
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Solutions to exercises 


Solutions to exercises 


Ex l.l The gamma-ray photon has an energy 

of 1.60 x 10~!°J. From Equation 1.1, the 

frequency of the photon is f = Ephoton/h, so 

f = 1.60 x 1071? J/(6.63x 10734 Js) = 2.41 x10% Hz, 
which is a frequency roughly 2.4 x 10!“ times that of a 
1 MHz AM radio photon. 


Ex 1.2 The poker would lose energy largely by 
emitting an enormous number of photons, some visible 
but mostly infrared. The energy of each photon would 
be so small compared to the energy of the poker that 
detecting anything but a continuous cooling down 
would be hard; yet, in principle, the cooling of the 
poker is discrete. 


Ex 1.3 The power produced is proportional 

to dN/dt, which is proportional to N, the number 
of radioactive nuclei present. Using Equation 1.7, 
the power produced after 7.30 years would be 

exp (—0.693 x 7.30/87.7) = 0.944 times the initial 
power. 


Ex 1.4 (a) The energy and the magnitude 

of the momentum of a photon are respectively 
hf and hf /c, so the magnitude of the 
momentum is the energy divided by c which is 

1 x 10° x 1.60 x 1071? J/(3.00 x 108 ms-*) = 
5.33 x 107? kg ms™t. 


(b) Force is the rate of change of momentum. Let p be 
the magnitude of the momentum of each photon. 

If the rate of absorption of photons is dN/dt, the 

rate of transfer of momentum to the lead block is 
pdN/dt; this is the force exerted on the lead, which is 
1N = 1kgms™?. Hence, 


dN lkgms~? 


= = 21-1 


(c) Using the results of Exercise 1.1 and noting that the 
momentum of a photon is proportional to its frequency, 
the number of AM radio photons required per second is 
2.4 x 1014 times greater than the answer to part (b), i.e. 
4.5 x 102° s71, An AM radio photon carries much 

less momentum, as well as much less energy, than a 
gamma-ray photon. 


Ex 1.5 The photon is described by a wave that goes 
both ways through the half-silvered mirror until it 
manifests itself at one or the other of the two detectors. 
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This is similar to the way that each photon must 
progress as a wave through the entire apparatus of 
Taylor until it manifests itself at one location on the 
photographic plate. 


Ex 1.6 The electron would be most likely to be found 
in region B, least likely in region C, and somewhere in 
between in region A. Since |Y (x, t)|? has moved as a 
whole to the left, we conclude that it is describing an 
electron moving in the direction of decreasing x. But 
see the next exercise! 


Ex 1.7 No! The regions where |W(a, t)|? is 
substantially non-zero overlap at the two instants of 
time. In particular, there is a significant chance of 
finding the electron in region C at the later time. 


Ex 1.8 The probability of being found in a 

small interval 6x, centred on position x, is 

|Wap(a, t)|? 6x. Let us use the symbol 6 to stand 

for the phase of the wave, so 0 = ka — wt. Then 
[Tap]? = |A|? le]? — |A|2e7 iP eti? Z | A|?e° Z |Aļ?. 
So the probability of finding the particle in the small 
interval ôx is independent of x. 


Ex 1.9 The function q(x) is never zero, being unity 
for all x since sin? z + cos? x = 1 for all x. The 
function p(x) is equal to zero when sin x + cos x = 0. 
This happens when sin x = — cos v, that is when 

tan x = —1, which is satisfied by z = 37/4 + nz, 
where n is any integer. 


Ex 1.10 Blocking path P1 (or P2, for that matter) 
would lead to photons being counted at Db. The 
probability of detecting a photon is pp = |b2|?. (This 
would be 0.25 since a photon has probability 0.5 

of reaching H2 and any photon that does so has a 
probability 0.5 of being detected at Db.) 


Ex l.ll For photons: if only one path is open, only 
one probability amplitude will contribute, and this is 
never zero, so an incident photon can appear at either 
output. But if both paths are open, the probability 
amplitude for a photon to appear at one of the outputs 
will be a sum of two probability amplitudes and this 
sum will be zero if the two probability amplitudes have 
equal moduli and opposite phases. 


For a classical wave: if the path lengths are appropriate, 
the wave that reaches the detector by one path can 
exactly cancel the wave that arrives by the other path. 


Ex2.1 (a) In this case: 
O19(2,t) = 2 (59243) 
, Ox 
= xt’, 


6 = o 2 243 
Oog(az,t) = (az + 3x ) (3x°t°) 


= 18t + 9x43, 
a 82 
O3g(x, t) = (2 + 5) (3274?) 
= 6t? + 15072, 
(b) In this case: 


~ 


Oıh(x,t) = Zla sin(kx — wt)] 


= kacos(kx — wt), 


Ooh(zx, t) = (35. + 3a?) [a sin(kx — wt)] 


= 3ka cos(kz — wt) + 327asin(ka — wt), 


Ji 2 
Ozh(x,t) = (= + 5) [asin(kx — wt)] 


= —k’asin(ke — wt) + 5a sin(kz — wt) 


= (5 — k*)asin(kx — wt). 


Ex2.2 (a) Q is not linear because 
Q(af + 89) = log (af + 6g) 
# alog f + p logg. 
(b) R is linear because 
2 


Ras + pa) = z(a + 69) 


(c) S is not linear because 
S(af + 69) = (af + 69)" 
# af? + Bg’. 


Solutions to exercises 


Ex 2.3 Letting the given operators act on the given 


function e!°”, we obtain 
d. 
iaz\ __: iax 
aS (e ) = 1ae 
and 
daia ; 
iax\ _ /: 2 niaz 
T (e ) = (id) 
= —q72eio®, 


So e°? is an eigenfunction of both operators, with 
eigenvalues ia and —a” respectively. 


Ex 2.4 Operating with P, on the given wave function, 
we obtain 


Pa (a, t) = —ih (Atrasan) 


Oa 
= —hkW(z,t). 

So, U(x, t) is an eigenfunction of the momentum 

operator, with eigenvalue —hk. We interpret this by 

saying that the momentum of the particle described by 

the wave is py = —hk. The negative sign makes sense 


because the wave Aei(—**—“) propagates in the 
negative «x-direction. 


Ex2.5 A free particle is one that is not subject to 
forces, so we can put V(x) = 0: The Hamiltonian 
function is then H = p2/2m. 


Ex2.6 With V(x) = C2, we get 
F; = —OV/0x = —Cz, as required. Adding this 
potential energy to the kinetic energy expressed in 
terms of momentum, the Hamiltonian function is 
Py 2 

H= a + tOr ; 
Ex2.7 The Hamiltonian function for this system was 
obtained in Exercise 2.6. Carrying out the standard 
replacement of Equation 2.24, the corresponding 
Hamiltonian operator is 


R o? 


f=- tog 
mör? i 


Ex2.8 (a) With no potential energy, we have 
OW (x, t) 


ih h? 0?W(z,t) 
i = : 
Ot 2m ôr? 
This is exactly Equation 2.7 as you would expect. 


(b) Since the particle is subject to Hooke’s law, we 
must substitute the one-dimensional harmonic oscillator 
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Solutions to exercises 


potential energy function V(x) = $C x” into 
Equation 2.19. Doing this gives 
OW (x,t h? OPW (x,t 
ih ae = + 5Cx°V(z,t). 


Ot — 2m ðr? 


Ex2.9 Using the properties of partial differentiation, 
we have 


AW, OW, 
O ea Oo Oe oe 
and 
Om Vv, | PW 
Ox2 [avs + bv] Sage Ox? 


In addition, it is clear that 
V(x) [at + bW2] = aV (£)Yı + bV (2) Wo. 


Combining all these results, we have 


Era 4+V(e)- ins) [ay + bY] 


which is equal to zero because both Y; and Wy satisfy 
Schrédinger’s equation. This shows that aY + bW2 
also satisfies Schrédinger’s equation, as required. 


Ex 2.10 Not quite. Any linear combination of 
normalized wave functions satisfies Schrédinger’s 
equation, but to represent a state, the linear combination 
must itself be normalized. This can always be achieved 
by multiplying the whole linear combination by a 
suitable normalization constant. 


Ex 2.11 Substituting T(t) = ei#*/" into the 
left-hand side of Equation 2.33 and performing the 
differentiation gives ih(—iE/h)ei#/" = Ee iP/h, 
which is equal to ET (t) as required. 


Ex3.1 Equation 3.1 would be replaced by V(x) = 0 
for —L/2 < x < L/2 and Equation 3.2 would be 
replaced by V(x) = œœ for x < —L/2 and x > L/2. 


Ex3.2 An energy eigenfunction w(x) does not 
depend on time. It satisfies the time-independent 
Schrödinger equation, which is the eigenvalue equation 
for energy. A wave function (x,t) depends on time as 
well as position. It satisfies Schrédinger’s equation and 
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provides the most complete possible description of the 
state of a system at all times. 


Comment: A stationary-state wave function, 
U,(a,t) = Wn(x)ei#n"/" is an energy eigenfunction 
at any fixed time. For example, at t = 0, we have 

WV ,(x,0) = w(x). However an energy eigenfunction 
should never be called a wave function because it 
does not contain the necessary information about 
time-dependence. 


The distinction between wave functions and energy 
eigenfunctions is maintained in our notation: upper case 
W is used for wave functions and lower case w is used 
for energy eigenfunctions. 


Ex3.3 Substituting the suggested form for the 
eigenfunction 7() into both sides of Equation 3.7 and 
carrying out the differentiations gives 


h2 2 . 
aa” [A sin(kx) + B cos(kir)| 


=E [A sin(kx) + B cos(ka)| 


for 0 < x < L, which will be true if E = h?k?/2m. 
This last condition is met provided that Equation 3.11 is 
satisfied. 


Ex3.4 If we replace k by —k in the general 
solution of Equation 3.10, and use the relations 
sin(—ka) = —sin(ka) and cos(—kx) = cos(kx) we 
see that 


w(x) = Asin(—kzr) + Bcos(—kz) 
= —Asin(kxr) + Bcos(kz) 
forO<a< L. 


So, from a mathematical point of view, using 

k = —V2mE/h instead of k = +V2mE/h is 
equivalent to reversing the sign of A. So far, we have 
placed no restrictions on the arbitrary constants A and 
B, which could be any complex numbers. Hence there 
is nothing to be learned by considering negative values 
of k that will not emerge from studying positive values 
alone. 


Ex3.5 From Equation 3.16, we have 


272 p2 1)2x2h2 
parr wa nmas ie 
2mL? 2mL? 
So the energy difference between two neighbouring 
energy levels with quantum numbers n + 1 and n is 
[(n +1)? — n?|n7h?2 [2n + 1]n?h? 


Enpa = 2m L2 = 2nL2 


This energy difference tends to zero as L tends to 
infinity, as claimed. 


Ex3.6 For the region inside the box, Equations 3.26 
and 3.27 give 


[Re(y(e,0)]? = Z sin? (ZEZ) cos? (Fet) 


L 
[im(V,, (2, t))]” = = sin? (=) sin? (=") : 


Adding the corresponding sides of these equations and 
remembering that cos?(E,t/h) + sin?(E,t/h) = 1 
gives 


2, 
(Enla) = Esin (=), 


which is independent of time, as required. 


Ex3.7 The coordinate x that appears in the argument 
of Yn (x,t) is used to label positions at which the 
particle might be found. If the position of the particle is 
measured, then |Y, (x, t)|? dx is the probability of 
finding the particle in a small interval ôx, centred on the 
position x, at time t. 


Ex3.8 Since e~i(En—Vo)t/h — @-iEnt/h eiVot/ħ the 
new wave functions are the same as the original ones, 
apart from a time-dependent phase factor e'Vot/”, 


It is always possible to multiply a wave function 

W(x, t) by an arbitrary phase factor without making 
any change to the predictions made using the wave 
function. For example, the probability of finding the 
particle in a small interval ôx, centred on x, is given by 
|W (a, t)|? 6x, and this is unchanged by multiplying 

W (x,t) by a phase factor. The modified stationary-state 
wave functions therefore describe the same behaviour 
as the original ones. 


Ex3.9 Dropping the indices n, and n, to simplify the 
notation, a stationary-state wave function takes the form 
U(x, y,t) = y(x, ye Et”. The corresponding 
probability density is 


(U(x, y,t)|? = [p(z ye 24/2 
= |w(a, y)|? e2]? 
= |b(x,y)|° 


which is independent of time. 
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Solutions to exercises 


Ex3.10 The energy of the ng = 7, ny = 4 state is 
given by 


Eng ny = 


To identify other states that are degenerate with this, we 
must find all the other pairs of positive integers (nz, ny) 
such that n2 + n? = 65. The only possibilities are (4,7), 
(8,1) and (1,8). The energy level that corresponds to the 
(7,4) state is therefore four-fold degenerate. 


Ex3.l] (a) From Equation 3.47, the lowest 
energy level and next-to-lowest energy level have 
energies: E111 = (1 + 1+ 1)r?h?/2mL? and 
E112 = (1+ 1 + 4r?h?/2mI?. It follows that the 
difference in energy between these two levels is given 
by 3r?h? /2mL?. 

(b) The lowest energy level is not degenerate, it 
corresponds to the unique state with the quantum 
numbers (1,1,1). The next to lowest energy level is 
triply degenerate since it corresponds to three different 
states (1, 1,2), (1,2,1) and (2,1,1). 


Ex3.12 According to Equations 3.24, 3.46, and 3.49, 
stationary-state wave functions have acceptable SI 
units of m~!/2, m~t and m~3/? in one, two and 
three dimensions. Born’s rule then shows that the 
corresponding probability densities, |V|?, have SI units 
of m~t, m~? and m`. This is appropriate because a 
probability density describes a probability per unit 
length in one dimension, a probability per unit area in 
two dimensions and a probability per unit volume in 
three dimensions. It means that integrals of probability 
density over lines, areas and volumes in one, two and 
three dimensions are equal to (unitless) probabilities. 


Ex3.13 The relevant time-independent Schrödinger 
equation is given in Equation 3.55. Substituting the 
given form of (x) into this equation and performing 
the necessary differentiations gives 


2 2 
a l Aea? 4 Bee] 
m 


=F | Ae” + Be] : 


This equation is satisfied provided that 
a? = —2mE/h?. Since we are considering cases in 


which Æ is negative, the condition on a? is satisfied if 


a = ,/2m(—E)/h. Thus, y(x) is a solution to the 
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Solutions to exercises 


time-independent Schrédinger equation; moreover, 

it is the general solution because it contains two 
arbitrary constants, A and B, which is appropriate for a 
second-order differential equation. 


Ex 4.1 The Hamiltonian operator involves 
differentiation with respect to x, but does not involve 
any differentiation with respect to time. When this 
operator is applied to a stationary-state wave function 
Un(a,t) = vn(x) ei#n"/", it does not affect the 
exponential phase factor, so we have 


HW,,(x,t) = (Hv,(x)) etA 
= (En Un(2)) one 
= LW abst): 


The stationary-state wave function satisfies the 
time-independent Schrédinger equation, and is 
therefore an energy eigenfunction with eigenvalue Fn. 
This is true at any time t, so if the energy of the system 
is measured, the value Ep will be obtained no matter 
what the time of the measurement. 


Ex 4.2 There is no possibility of obtaining this energy 
because it is not one of the energy eigenvalues of a 
particle in a one-dimensional infinite square well. 


Ex 4.3 The proof is essentially the same as that given 
in Exercise 2.9 of Chapter 2. We have 


> ., OW a 2 O 


a ð 
= (ain) (a, 1 + agW2+---) 


~ y 
= a1 (fv — ih =) 


< ., OW 

+ ag (ivs ih DE ) + ; 
where the last step follows because the Hamiltonian f 
and ih O/Ot are both linear operators. In the final 
expression, each of the terms in brackets vanishes 
because the stationary-state wave functions Y; 
separately satisfy Schrédinger’s equation. Hence Y also 
satisfies Schrédinger’s equation. 


Ex 4.4 We need to consider the integral 


iwe Yi (2) Ym (2) da 


2 i ; (=) . ear 
S in {| — ) sin me 
ap s T s T £ 
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The integral can be evaluated by changing the variable 
of integration to u = mx/L. We have dx = (L/r) du, 
and the limits of integration become u = 0 and u = 7, 
so 


2 OL f" 
bin T* =a sin(nw) sin(mu) du. 
Finally, using a standard integral given inside the back 
cover of the book, we conclude that 
2 Lt 


Inm = =X x = Ong = Onm 


L nw 2 


which confirms that the eigenfunctions are orthonormal. 


for all positive integers n and m, 


Ex 4.5 Neither the normalization condition nor 

the probabilities depend on time. We can therefore 
save some effort by performing the calculation at 

t = 0, when the time-dependent phase factors are both 
equal to 1. The normalization condition then gives 

1 = |2A/? + | — 3A]? = 13] A]?, so we can choose 
A =1/,13. The probabilities of getting energies 
E; and F are then pı = |2/\/13|? = 4/13 and 

p2 = |-3/V/13|? = 9/13. As a final check, note that 
the sum of these two probabilities is equal to 1, as 
expected. 


Ex4.6 (a) Let 


r= falda, 


where f(x) is an odd function. If we change the 
variable to y = —2, the limits of integration reverse 
sign and dx = —dy, so the integral becomes 


—a 
T=] f(y) (~d). 
a 
Swapping the limits of integration, and remembering 
that f(—y) = —f(y), we then obtain 


r=(-1)8 f fy)ay=-1, 


which is possible only if J = 0. 


Comment: Figure S4.1 is a graphical interpretation of 
this result. It is clear that each contribution to the 
integral from a small interval with x > 0 is cancelled by 
a contribution of equal magnitude and opposite sign 
from a similar interval with x < 0. 


f(z) 


Figure $4.1 The definite integral of an odd function 
f(x), taken over a range that is centred on x = 0, is 
equal to zero. 


(b) The probability amplitude for getting energy Æ in 
the given state is 


Co 
= | We) Wear 
-VIVE a 
Sa L Da 
This is equal to zero because the integrand is an odd 
function and the range of integration is centred on the 


origin, x = 0. The probability of getting energy Fù is 
therefore equal to zero. 


Ex 4.7 The probability amplitude for energy E3 is 


[0,0] 
y= | wle) 
-VIVE S ys 
[on L jo 
Following the method given in Worked Example 4.1, 


we change the variable of integration to y = 37a/L. 
This gives 


c3(0) = yo | ecm y’ cos y dy 


—3n/2 


W(x, 0) dx 


L2 L 37/2 
x — cosydy| . 
4 3m —37/2 


Then, using a standard integral given inside the back 
cover of the book, 


/60 L 3n? -8 2 ah 
L3 3m 2 12r 


v60 


c3(0) = 


Ce 


(P(e 


Solutions to exercises 


so the probability of energy Es is 


960 
= je = ——_— 
P3 |c3( )| (37) 
Comment: Just over one measurement in a thousand 
gives the energy E3. Energies like E5 and F7 are also 
possible, but even less likely. 


= 1.37 x 107°. 


Ex 4.8 According to the solution to Exercise 4.5, the 
probabilities of energies Æ; and E» are pı = 4/13 and 
p2 = 9/13, and the probabilities of all other energies 
are equal to zero. Hence the expectation value of the 
energy is (E) = (4E) + 9F2)/13. 


Ex 4.9 The wave function is the same as that in 
Worked Example 4.2. The time-dependent phase factors 
cancel out as before, so for py we are left with 


(Px) = a cos (=) (-in =) cos (=) da 


ir P 2 = . (=)a 
= — x> — } sin | — ) dz. 
T Dw cos (= ] 8 T 
This integral vanishes because the integrand is an odd 


function of x and the range of integration is centred on 
the origin. Hence 


(pr) =0. 


The expectation value of p2 is given by 


(p2) = T cos (=) (-in oY cos (=) dz 


The z a (Za 
= X= — ) dz. 
T2 Idag cos (= 
The remaining integral is easily evaluated, but 
direct calculation can be avoided by noting that 


(2/L) f T> cos? (mg/L) dg is the normalization 
integral for the wave function, and so must be equal 
to 1. Hence 
r2 h2 
(pz) = 


Ex 4.10 The expectation value of the energy is given 
by 


(E) = a W*(x,0) H U(x, 0) dz 


30 L/2 ( J =) 
= — G-— 
Bf py. 4 


Solutions to exercises 


where we have used the fact that the wave function 
vanishes outside the well. Evaluating the second 
derivative, we obtain 


30 [4 i? hi? 
l peme — 2 —}; D 
(E) L5 = € =) ( va 


7 30h? Ë a 


mL | 3 4 _L/2 
30h? 3 B D 
mL’ 3x8 8 
_ P 
mL 


Comment: The expectation value is slightly greater 
than the ground-state energy, 7?h?/2mL?. This 

agrees with the results of Worked Example 4.1 and 
Exercises 4.6 and 4.7, which showed that there is a very 
high probability of obtaining the ground-state energy in 
this state, and much smaller probabilities of obtaining 
higher energies. No matter what the state of the system, 
the expectation value of the energy is always greater 
than or equal to the ground-state energy. 


Ex 4.11 Any measurement of energy in the ith 
stationary state is certain to yield the corresponding 
energy eigenvalue E;. Consequently, (E) = E; and 
(E?) = E?. The uncertainty in energy is therefore 


AE = \/(E®) — (E)? = \/E? — E? =0. 


Ex 4.12 The square of the uncertainty in energy is 
(AE)* = (E”) — (E)’. 


The probabilities of getting energies F4 and E> in the 
given state are each equal to 1/2, so 


(E) = E1 + 4E and (E°) = 1E? 4+ 4E. 


Hence 
(AE) = į (Ef + E3) — ¿(E1 + BE)’ 
= }(E} + E3 — 2E: E2) 
= I(E a E1)’, 
so 


AE = ¿(E — E1). 


Ex 4.13 In the absence of thermal energy, the kinetic 
energy is entirely due to quantum effects (arising from 
the uncertainty principle). Using Equation 4.37, the 
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average kinetic energy of a helium atom has a minimum 
value of order 


230 23 x (1.06 x 10734 J s)? 
2ML? 2x 6.7 x 10-27 kg x (3.5 x 10710 m)? 
= 1.6 x 107? J, 
Comment: This is roughly equal to the average 


thermal energy of an atom at a temperature of 8 K. 
At temperatures well below this, liquid helium is 
dominated by quantum effects, and it is called a 
quantum liquid. 


Ex 4.14 As suggested in the question, we take 

Ax = s, where s is the typical electron—proton 
separation. The uncertainty principle then shows that a 
typical momentum component in the z-direction obeys 
Px ~ Apr > h/2s, with similar results applying in 
the other two directions. Consequently, in quantum 
mechanics, the typical energy obeys the inequality 


3h? e? 
= 8ms? Ames 
Differentiating with respect to s to find the minimum of 
the right-hand side gives 
3h? e? 
4ms? a Aregs? 


NYO) 
3negh? 
~ mez 
_ 3m x 8.85 x 107 x (1.06 x 107-34)? 
~ 9.11 x 10781 x (1.60 x 10719)2 
=40x 107! m. 


Substituting into the expression for the energy, we 
obtain 


m 


E — 3h? met e me 
= Sm (3negoh?)? 4reo 3neohi2 
O 2met* 
~ 3(4me0h)? 


2 x 9.11 x 10731 x (1.60 x 10719)4 
3(4r x 8.85 x 10712 x 1.06 x 10734)2 
= —2.9 x 10718. 


These values of s and Emin are our estimates for the 
radius and ground-state energy of a hydrogen atom. The 
ground-state energy is negative because the potential 
energy is zero when the particles are infinitely far apart. 
A bound hydrogen atom therefore has negative potential 
energy. 


Comment: Our estimates turn out to be almost 
correct. Full solutions of Schrédinger’s equation show 
that s is too small by a factor of 3/4, while | Emin| is too 
large by a factor of 4/3. Such errors are only to be 
expected, given the roughness of our approximations, 
but we have still achieved something remarkable. Using 
only a classical expression for the total energy of 

an electron interacting with a proton, together with 

the uncertainty principle, we have obtained good 
order-of-magnitude estimates of the ground-state 
energy and radius of a hydrogen atom! 


Ex5.! The particle comes instantaneously to rest at 
its points of maximum displacement, where x = +A. 
At these points, the kinetic energy is zero, and the 
potential energy is 5C A’, so the total energy is 


E=0+4CA? = 402. 
This remains constant throughout the oscillation, by the 
conservation of energy. 


Comment: Figure S5.1 is a graphical interpretation of 
this result. This figure compares the potential energy 
function 5Cx? with the total energy Æ. The kinetic 
energy is non-negative, so the potential energy is always 
less than, or equal to, the total energy E. The particle is 
therefore confined to the region where 5Cx? <E. 

This inequality becomes an equality at the points of 
maximum displacement, x = +A, where E = 102. 


Acnergy 


Figure $5.1 An oscillating particle is confined to the 
region —A < x < A, where the potential energy is less 
than or equal to the total energy E. 


Ex5.2 Differentiating Equation 5.4 with respect to 
time, 
_ dx 


Ur = u —woAsin(wot + ¢), 


Solutions to exercises 


so 
Pr = —MmwoAsin(wot + ¢@). 
Substituting into Equation 5.6, and using the 
relationship wo = ,/C'/m, the total energy is 
E = {mwg A’ sin? (wot + o) + CA? cos”(wot + 4) 
= 4CA*(sin?(wot + $) + cos? (wot + ¢)) 
— 1042 
= 3CA’," 
which is independent of time, as required (and agrees 


with the result of the preceding exercise). 


Ex5.3 Using Equations 5.3 and 5.5, and recalling that 
the appropriate mass is the reduced mass, gives 


wy = Inf =, 


so 
C= 4n? f? u. 


Let the mass of a hydrogen atom be m. Then the 
reduced mass of a hydrogen molecule is 


2 


_ m = m 
K= om 2’ 
and we have 
C= wP 
= 27? (1.25 x 10'4 Hz)? x 1.67 x 1072" kg 
= 520 N m™t. 


This is comparable to the force constant of a stiff rubber 
band. 


Ex5.4 Substituting U(x, t) = y(x) e'”*/" into the 
two sides of Schrédinger’s equation gives 


ne = n(-) h(a) e-iFt/h 
= B(x) ei#t/* 
and 
Hv = [= ay + Imwér? wo) gue 
2m dz? 2 0 f 


Equating these expressions and cancelling the 
exponential factor on both sides, we see that y(x) 
satisfies the time-independent Schrödinger equation for 
a harmonic oscillator. (Equation 5.15). 
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Solutions to exercises 


Ex5.5 The normalization condition gives 


i= / “kefa 


_ is [Coe 27/20" ? 


= co? f et le dg. 


Changing the variable of integration from x to y = x/a, 
and noting that dz = ady, we obtain 


29 2 
1= (Coba f e™ dy. 


— o0 


dx 


According to the list given inside the back cover, the 
remaining integral is equal to \/7, so we have 


1 
G =-=: 
|Co| Te 
The result quoted in the question is obtained by taking 
the square root of both sides of this equation, choosing 
Co to be real and positive as usual. 


Ex 5.6 (a) The dimensions of A are those of 
energy x time = [M][L]?[T]~1, so the dimensions of 
a = \/h/muy are 


(b) Exercise 5.1 showed that a classical harmonic 
oscillator with amplitude A has energy E = 102. So, 
at energy Eo = ħwo/2, the classical amplitude is 


a i 
Vvo Vo’ 


Using the relationship wo = ,/C/m to eliminate C, we 
obtain 


| hwo LR 
= TD = — = a, 
mwĝ Mwo 


which is the length parameter of the oscillator. 


Ex5.7 Probabilities do not depend on time for 
stationary states, so we can carry out the calculation 
at time t = 0, when the stationary-state wave 
function is equal to the energy eigenfunction yọ(x). 
The probability Pout of finding the particle in the 
classically forbidden region is the total area under the 
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graph of |y9(a)|? in the regions —oo < x < —a and 
a < x < oo. Thus 


Pee f e dae + f Iwo(a)[P dz. 


The eigenfunction w(a) is an even function, so we 
have 


fore) 2 oo 
Pout = 2 | [eho (a) |? dz = — | e/a? 


Changing the variable of integration to y = x/a, as in 
Exercise 5.5, and using the definite integral given in the 
question, we conclude that 


Pa [ a 2 x 0.139 = 0.157 
ee T i = Jr z y d 


So there is a reasonable chance (nearly 1 in 6) of finding 
the particle in the classically forbidden region |x| > a. 


Ex5.8 Acting on A Wn(x) with the Hamiltonian 
operator gives 


[A t,(x)] = (A'A + 3) hun [A dn(2)]. 


We_ ae Use of the commutation relation 
AA'-A'TA=1 (Equation 5.32) to write 


AA AAI 
and hence 


Then, pulling out a factor A from terms on the 
right-hand side, we obtain 


FA dn(x)] = A(ATA — 1 + 4) hu dn(2) 
= A(H — hwo) Yn(2) 

A(En — ħwo) n(x) 

= (En — hwo) [A Yn(2)], 


the required result. 


= (AA'—1+42)hwo[A vn(2)]. 


Ex5.9 We use Equation 5.45 to express the required 
expectation value in terms of raising and lowering 
operators: 


=-5 [vie AB) vale) 


Following the same approach as in Worked 
Example 5.2, we obtain 


2 ee) cee oe 
w= | w(x) (ATA + AAS) n(x) de. 


The integral is exactly the same as that evaluated in the 
worked example, and has value (2n + 1), so we 
conclude that 

h2 


as required. 


Ex5.10 Let J be the integral on the right-hand side of 
Equation 5.53. Then, in terms of raising and lowering 
operators, we have 


I= a i. w(x) (A + A) dn(x) de 
-5 fe E OLEOLE 


+8 i (2) Yensa(a) de , 


where a and 8 are constants (whose values are not 
needed). Using the fact that eigenfunctions with 
different eigenvalues are orthogonal, we see that 
this integral is equal to zero unless n = m — 1 or 
n = m + 1, corresponding to transitions between 
neighbouring energy levels. 


Ex 6.1 Integrating both sides of Equation 6.4, we 
obtain 
Co Co 
f W* (x,t) U(r, t) de = [a]? f TOOL? 
—oo —oo 


+ laa? f viele) dz 
+ ajase f” yi (ajyala) dz 


Lae os f yš(æjýı(x)dz. 


The integrals in the first two terms are both equal to 1 
because the energy eigenfunctions are normalized. The 
integrals in the last two terms are both equal to zero 
because different energy eigenfunctions are orthogonal. 
Using Equation 6.3, we therefore conclude that 


lee) 
f T* (z, (x,t) de = Jal? + la|? =1, 
-00 


as required. 


Comment: The fact that the linear combination of 
stationary-state wave functions is normalized entitles us 
to call it a wave packet. Note that the wave packet 


Solutions to exercises 


remains normalized at all times because the coefficients 
aı and ag are constants. 


Ex6.2 At time t = T = 2r /wo, the phase factors 


in Equation 6.7 are e~i0!/? = ei" = —1 and 
e™3iwot — e—3it — —1, Hence 
1 
Va(z,T) = -—; [vo(x) + yı (x)] = -Y4 (z, 0). 


v2 
(This reversal in sign has no physical consequences; it 

is an example of multiplying a whole wave function by 
a phase factor, a process which never has any physical 

consequences.) 


For t = 2T, the phase factors are e 77 — ] and 
e— Sit = 1, so Wa (x, 2T) = Y4 (z,0) and the wave 
function returns to its initial value. 


Ex 6.3 Using Equations 6.8 and 6.9, we find that 


Ya(z,0) = 5 es i +22) en 8? /2a? 


and 


Wa (z,T/2)= 


i il 1⁄2 1 v T] 92/292 
V2 (=) | 7 A l 
In Y4 (z, 0), the two terms in square brackets have the 
same sign for x > 0 (constructive interference) and 
opposite signs for x < 0 (destructive interference), so 
the probability density |W, |? is concentrated on the 
right-hand side of the well. In UV, (zx, 7/2), the two 
terms in square brackets have opposite signs for x > 0 
(destructive interference) and the same sign for x < 0 
(constructive interference), so the probability density 
|W a]? is concentrated on the left-hand side of the well. 
Also note that our explicit expressions show that 
U(x,T/2) = —iV(—z, 0), so the probability density at 
T /2 is the reflection of the probability density at t = 0, 
in agreement with Figure 6.1. 


Ex6.4 We have 


(= a ot) and (pr) = E sin(wot) 
so 
x maw maw 
me = TER sinfni) = TE pa). 


Since a = \/h/mwo, we conclude that 
md(a)/dt = (px). 


Ex6.5 The energy eigenfunctions (x) and w3(x) 
are both odd functions of x, so Y (x,t) and U*(x, t) are 
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Solutions to exercises 


also odd functions, with U(—x,t) = —W(x,t) and 
W*(—az,t) = —W* (x,t). Hence 
|W (—a, t)|? = U*(—2, t)U(—z, t) 
= (-1)?W* (x, t)U(z, t) 
= |U(a,1)/?. 
So the probability density is an even function of xz, and 


is therefore symmetrical about the centre of the well at 
all times. 


The expectation value of position is given by the 
sandwich integral 


as / "enian 


—Cco 
which is equal to zero because the integrand is an odd 
function (being the product of three odd functions) and 
the range of integration is symmetrical about the origin. 
(Alternatively, we could note that the operator Â + Al, 
acting on a linear combination of y(x) and w3(2), 
produces a linear combination of W(x), W2(x) and 
a(x), which is orthogonal to the original function. 
This again implies that (x) = 0.) 
The wave packet does not oscillate to and fro across the 
well, but breathes in and out with an uncertainty in 
position that increases and decreases cyclically. 


Ex6.6 We have 0OV/Ox = O(mgzx)/Oxz = mg. 
Combining E1 and E2 then gives 


d(x) _ 1 d(pe) (" 


dt? m dt m\ar/ 9- 


The general solution of this differential equation is 

(x) = A+ Bt- gt’, 
where A and B are constants that depend on the initial 
conditions, i.e. the values of (x) and d(x) /d¢ at time 
t = 0. This solution remains valid so long as the 
potential energy function is V(x) = mgz. In practice, 
this ceases to be so when the wave packet describing the 
state of the particle extends as far as the ground, since 
forces other than gravity then come into play. 


Ex 6.7 The initial wave function U(x, 0) is 
symmetric, so it is an even function of x. The energy 
eigenfunctions in the harmonic well are either even or 
odd functions of x: q%o(x), w2(x),... are even while 
W1(x), ¥3(x),... are odd. If we now consider the 
overlap integral of Equation 6.30, 


oe f. w*(«) U(x, 0) de 
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we see that when 7 = 1,3,5,..., the integrand is the 
product of an odd function 7 (x) and an even function 
W(x, 0), and so is an odd function. The range of 
integration is symmetrical about the origin, so the 
integral vanishes in this case, and we conclude that 

a; = 0 for i = 1,3,5,.... From Equation 6.26, the 
wave function at future times is therefore given by 


U(x, t) = aopolrje E/P + agyo(x)e24/" + 


This only involves the even functions W(x), ¢2(x),..., 
so (x,t) is itself an even function, with 

Y(—zx,t) = V(x, t) for all t. Hence, the wave function 
remains symmetric at all times. 


Ex 6.8 The expectation value of momentum is given 
by the sandwich integral 


(oe) 
(Px) = f WU" (x, 0)p,, U(x, 0) dz, 
—00O 
where U(x, 0) = azp2(x) + asis(x). The momentum 
operator P, can be expressed as a linear combination of 
lowering and raising operators. When these operators 
act on the harmonic oscillator eigenfunctions y2(x) and 
Ws5(x), they produce terms proportional to y(x) and 
w3(a) in the first case, and q4(x) and we (a) in the 
second case. Each of these is orthogonal to the original 
eigenfunctions 72(x) and ws5(a). Using these results, 
the above sandwich integral produces a sum of terms 
each of which is equal to zero (by orthogonality). 
Hence (pz) = 0. 


Comment: This answer is complete. If you would 
prefer a more explicit derivation, you could supply one 
using the above sandwich integral and Equations 6.12, 
6.13, 6.15 and 6.29. Symmetry arguments are 
ineffective in this case because the wave function 
W(x, 0) is neither even nor odd. 


Ex6.9 Since Eg = h?k?/2m, the right-hand sides of 
the two equations given in the question are equal. 
Hence, 

Perv. av 

=i 

2m Ox? Ot’ 
so W(x, t) is a solution of the free-particle Schrödinger 
equation. 


Ex 6.10 Born’s rule for momentum tells us that the 
probability of finding the momentum in a small interval 
of momentum h ôk, centred on hk, is | A(k)|? ôk. 


Consequently, the expectation value of the momentum 
is 


(pa) = [nk AC) dk, 


and the expectation value of the kinetic energy is 


oo R2 k? f 
(a) = | G IAG) dk. 
In a free-particle wave packet, the momentum 
amplitude function A(k) is a time-independent 
coefficient used in a linear superposition of de Broglie 
wave functions. In more physical terms, the momentum 
distribution remains fixed because there are no forces 
acting to change it. It follows that (p,) and (Ekķin) are 
independent of time for any free-particle wave packet. 


Ex 6.11 Including the factor 1/\/27 in the 
momentum eigenfunction simplifies several other 
equations. It allows Born’s rule for momentum to be 
expressed in terms of |A(k)|? 6k and it allows the 
momentum amplitude A(k) to be expressed as a simple 
overlap integral [°° w(x) U(x, 0) da. 


Ex6.12 Since Az = a/V2, we find that 
a = V2 x 1073 m at t = 0. We wish to know Az at 
t = 5hours = 1.8 x 104s. Using Equation 6.42, and 
taking values of the required constants from inside the 
back cover, we get: 
1.06 x 10-* Js: x 1.8 x 10*s 
V2 x /2 x 10-3 m x 9.11 x 10-31 kg 
= 1.05 x 10° m. 


Age 


Comment: This wave packet would take about 

10 minutes to stroll across! This is remarkable, though 
it does assume that the electron is isolated; even in 
space, a real electron would encounter atoms that would 
make the wave packet collapse long before it reached 
that size. This ‘collapse’ is what happens when a 
detecting screen makes an electron ‘choose’ the place 
where it appears. Remember that the size of the wave 
packet is not the size of an electron, only a measure of 
the size of the region in which it might be found in a 
position measurement. 


Ex 7.1 The two bumps in the probability density have 
roughly equal areas. This indicates that the probability 
of transmission is roughly equal to the probability of 
reflection. Since the sum of these two probabilities must 
be equal to 1, the probability of each outcome is about 
0.5. 


Solutions to exercises 


Ex 7.2 In view of the quantum behaviour of individual 
particles (as represented by wave packets) when they 
meet a finite square barrier, it is reasonable to expect 
that there is some chance that the particles encountering 
a finite square step will be reflected. In the case of 
quantum scattering we should therefore expect the 
outcome to include a reflected beam as well as a 
transmitted beam, even though Eo > Vo. 


Ex 7.3 (a) From Equations 7.21 and 7.22, 
(ky — k2)? + 4ky ko 
R+T= 
(ky + ke)? 
= k? + k2 + 2kı kə 
(ky +k)? 
= 1. 
(b) From Equation 7.13, we have 
4mVo V2mVo 
kı = and k= . 
h h 
So, in this case, kı = V2kə. Therefore 
ky — k2)? 2—1)? 
ra 2i SA2) = 0.03, 
(kı + k2) (V2 +1)? 
Akyk 4/2 
=_— = O _ 097. 
(ki tka)? (241)? 


So we can check that R + T = 1 in this case. 


Ex7.4 (a) When k2 = kı/2, we have 
ky — ko A 2kı 4A 
ky + ko 3 = ky + k2 3 


(b) From Equation 7.15 with D = 0, 
|y]? = Ceilk2z—wt) |2 = |c]? = 16) 412, 
9 
Similarly, from Equation 7.14, 


|y]? — Aeilkiz—ut) ri pe™itheron |? 


— e wt (Aar + Be“) 3 


vs ae 2 
— eit x |A Cau 4 ze he) | ; 
Since |e~“*| = 1, we have 


|y]? — Al? Ga + T (m + te he) 


— AJ (e:r + tet) (el 4 teir), 
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Solutions to exercises 


Multiplying out the brackets, we find 
|y]? — IAP [1 ge 5 4 Leas 4 eh) | 


= |A? [2 + 2 cos(2ki2)]. 


(c) The variation indicated by the cosine-dependence to 


the left of the step is a result of interference between 
the incident and reflected beams. The presence of 
interference effects was noted earlier when we were 
discussing the scattering of wave packets but there the 
effect was transitory. In the stationary-state approach 
interference is a permanent feature. 


(d) The linear number densities in the incident, 
reflected and transmitted beams are given by 

|A|?, |B|? and |C|?. The question tells us that 

|A|? = 1.00 x 1074m7!, so the linear number density 
in the reflected and transmitted beams are 


Al? 
BP = ue =111% 10" m}, 


_ 16|A\? 
© 9 
Note that the transmitted beam is denser than the 


c? = 1.78 x 10% m~t. 


incident beam: |C|? > |A|?. However, since kz = k1/2, 


we have jirans < jinc. The transmitted beam is less 
intense than the incident beam because it travels much 
more slowly. 


Ex 7.5 A suitable graph is shown in Figure $7.1. 


[ue 


-3r fk rfk 0 afk Bn/ky © 


Figure $7.1 ||? plotted against x for a finite square 
step at x = 0 when Ep > Vo. 


Ex 7.6 One way of showing that a quantity is real is to 


show that it is equal to its own complex conjugate. 
Taking the complex conjugate of each factor in jx, we 
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obtain 
P ih ow* OV 
B= + om (x Ox j =) 
ih Ow ðv) 
= (v Jz y Da ) = Flee): 


Since 77 = jx, we conclude that jy is real. 


2m 


Ex 7.7 Inthe region z > 0, 
we) = Ce?” and oe = Ce. 
so Equation 7.29 gives the probability current 
ih 


— C*e7ik2t (ikoCe'*2*) 


— Celk2# ( — incre) 


In the region xz < 0, 
p(x) = Ae? + Be” 
and 
p= Ate ihe 4 p*r, 
so Equation 7.29 gives the probability current 
Jas ni (A*e itis I Breit) 


x (ik1) (Ae*™* — Be **1*) 


—(Ael** 4 Be”) 
x ( _ ik1) (A*e 2 _ Brelli®) ; 
Simplifying this expression, we obtain 


hk 
= — [|4? - 1897]. 
This can be interpreted as the sum of an incident 
probability current, Ak, |A|?/m, and a reflected 
probability current, —hk;|B|?/m. These two 
contributions have opposite signs because they flow in 
opposite directions. Note that, in each region, the 
probability currents are consistent with the incident, 
reflected and transmitted beam intensities assumed 
earlier (and now justified). 


Ex 7.8 The main difference is that in the case of a 
finite square barrier Vo must be replaced by — Vo 


throughout the analysis. The wave number in Region 2 


is 

ba \/2m( Eo — Vo) 
i h 

Adapting Equation 7.44, the transmission coefficient is 

then given by 


= 4Eo( Eo — Vo) 
4Eo(Eo — Vo) + VŽ sin?(k2L) 


A typical graph of the real part of u(x) for scattering 
by a finite square barrier is shown in Figure S7.2. 

For a barrier, the wavelength in Region 2 is longer, 
corresponding to a smaller wave number and a smaller 
momentum. The amplitude of the wave is generally 
greater in Region 2 than outside it (unless T = 1). 


tRey) | ! 
m= 0 le = fb 
0 T 
AcCnerey 
Eo l 
a f J 
Region 1 Region 2 Region 3 
ES mmm = 
s 2=0 ap = IL k 


Figure S7.2 A typical graph of the real part Re(q) of 
y(x) for a finite square barrier. 


Ex 7.9 The graphs of |W (x, t)|? plotted against x for 
each case are shown in Figure S7.3. 


Solutions to exercises 


[w/t ! 
cnergya 
Eo : 
aaa Ta 
Region 1 Region 2 Region 3 
a |e ae ed | J 
Me ag =U) m= ib 
(a) 
wrt : : 
CNCrEYA : | 
| 1 
Eo 
E, i 
Region 1 Region 2 Region 3 
B= r=0 r=L E 


Figure S7.3 | W(x, t)|? plotted against x for 
(a) a finite square well and (b) a finite square barrier. 


In Regions 1 and 2 interference effects lead to periodic 
spatial variations with a period related to the relevant 
wave number. In Region 3 there is only one beam, so 
there are no interference effects. Note that there is no 
simple relationship between the plots of |W(a, t)|? and 
the corresponding plots of the real part of U(x, t). This 
is because |W (x, t)|? is partly determined by the 
imaginary part of (2, t). In both cases |W(z, t)|? is 
independent of time. This must be the case, despite the 
time-dependence of U(x, t), because we are dealing 
with stationary states. 


Ex 7.10 From Equation 7.46 
w(x) — AelMt 4 Be`ikis 


and 
* (zx) — Ate ike + Breisie 
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Solutions to exercises 


SO 


Iab(x)[? = (Are + B*eiki?) (Acihiz 4 Behe) 
= A*A + B*B + A* Be 2ikix + B* Aes, 


This is a positive periodic function that runs through the 


same range of values each time x increases by 27 /2k1, 
so its minima are separated by 7 /k1. 


Ex 7.11 Using Equation 7.47, with D = 0 and a real, 
and recalling the definition of probability current given 
in Equation 7.28, we have 


ja(@,t) = 
ih * —Ar o —QT —QT 0 * aT AT 
ae Cre zz (Ce )-Ce 5g (Ce | 
ih 


2m 


Cea C°C(- ae 


= 0; 
Ex 7.12 From Equation 7.29, 


in [ Op Ove 
=v Ox oF. 


jx(%) = 


In Region 1, where y(x) = Ae? + Be7ihi2 


and y* (x) = A*ei*17 + B*e, we have 


(A*e 17+ B*e) (ik; Ae*!* —iky Be”) 


—(Ae* + Be") (—ik, A*e~**!* + ik, B*el*!”) ; 


Simplifying gives 


„hka 
om 


Ja [A*A — B*B] forz< o0, 


which can be interpreted as the sum of the probability 
density currents associated with the incident and 
reflected beams. 


Similarly, in Region 2, where y(x) = Ce~°*” + De®” 
and y* (x) = C*e~% + D*e°*, we have 
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dn = Oih (C*e "4+ D*e™) (—aCe “*+aDe) 
m 


—(Ce** + De) ( ~aC*e + aD*e**) , 


which simplifies to 


iñ 
jo =—-——[C*D-CD*"] for0<2<L. 
m 


Finally, in Region 3, where y(x) = Fe”, 


. _ hk 
om 


Fes [F*F] fora > L. 


The result for Region 2 is a real non-zero quantity, so 
there is a probability current inside the barrier. This is 
not really surprising since particles must pass through 
the barrier to produce a transmitted beam in Region 3. 
In fact, conservation of probability requires that the 
probability current should be the same in all three 
regions. 


Ex8.!| For z = 3 — 4i, the real part is 3, the imaginary 
part is —4 and the modulus is |z| = 4/3? + (—4)? = 5. 


Ex8.2 Given z1 = 1 + 2i and z = 4 — 3i, we have: 
@) zy 4+ 2 = (142i) + (4-31) =5-i, 
(b) 221 — 3z2 = 2(1 + 2i) — 3(4 — 3i) 
= —10 + 13i, 
(C) 2129 = (14 2i) (4 — 3i) 
= 4 + (2i)(—3i) + 8i — 3i = 10 + 5i, 


(d) z 142 1+2i 4+3i 
z As 4-3i Ri 

_ 44 (2i)(3i) +8i+3i -2 411i 

E 25 ~ oR 


Ex 8.3 Taking the complex conjugate of z gives 
z* = [i(ab* — a*b)]* 
= —i(a*b — ab") 
= i(ab* — ařb) 
= 2. 
We conclude that z is real because z* = z. 


Ex8.4 (a) For z = 1 + iV3, the radial coordinate is 
r = |z| = V1+3 = 2. The phase 0 satisfies the 


equation cos 0 = x/r = 1/2. We are looking for a 
solution in the first quadrant (0 < 8 < 7/2), so the 
appropriate choice is 0 = 7/3, giving 


z = 2| cos(m/3) + isin(7/3)]. 


(b) For z = —1 + iv/3, the radial coordinate is again 
r = |z| = v1 +3 = 2. The phase 6 satisfies the 


equation cos 0 = x/r = —1/2. We are looking for a 
solution in the second quadrant (7/2 < 6 < 7). Since 
cos(7 — @) = — cos ¢, the appropriate choice is 


8 = T — 7/3 = 27/3, giving 
z = 2| cos(27/3) + isin(27/3)]. 


Ex 8.5 Using Equation 8.5, we have 
|z|? = zz* = r(cos@ + isin 8) x r(cos 0 — isin 8) 
= r*(cos? 0 + sin? 6) = r°. 
Since both |z| and r are real and positive, we conclude 
that |z| = r. 


Ex 8.6 Euler’s formula, and its complex conjugate, 
tell us that 
e? = cos@ + isin 8 


e” = cos 0 — isin 0. 


Adding these two equations and dividing by 2 gives 
the required expression for cos 0, while subtracting 
the equations and dividing by 2i gives the required 

expression for sin 0. 


Ex 8.7 We have 
e3 e7iT/6 — e3 [cos(7/6) = isin(7/6)| 


e33 . e3 
= -i-—. 
2 2 
Ex 8.8 The complex number i = eit/ 2 so if z =e, 
then 
iz — et/2gi0 — ei(0+7/2) 
a = ev = ei(—7/2) : 
i ein/2 


Thus, multiplying by i produces an anticlockwise 
rotation of 7/2 rad = 90°, while dividing by i produces 
an anticlockwise rotation of —7/2rad = —90° (that is, 
a clockwise rotation of 7/2 rad = 90°). 


Solutions to exercises 


Ex 8.9 A lot of unnecessary effort can be avoided by 
converting z to exponential form. It is easy to see that 
z = (1+i)/V2 = e'™/4, Hence 

z100 _ (ein /4)100 _ e25 = cos(257r)+isin(257) = —1. 
Ex8.10 The solution given in the main text can be 
followed unchanged up to the point where the boundary 
conditions are used. As in the main text, we can reject 


the possibility that A = 0, since this only gives the 
trivial solution y(x) = 0 for all x. 


For positive A, we can set \ = k. Then the general 
solution of the differential equation is 


ylz) = Ae"? + Be“** 
so 
y' (x) = kAe*” — kBe™”. 
The boundary conditions y(0) = y/(L) = 0 give 
A+B=0 and kAe*’—kBe** =0. 


Rejecting the trivial solution A = B = 0, there is no 
way of satisfying these equations since they imply that 


(Eqn 8.32) 


e2kh — —1, which is impossible. Hence A cannot be 
positive. 
For negative A, we can set \ = —k?, where k can be 


taken to be positive. Then the general solution of the 
differential equation is 


y(x) = Ccos(kx) + Dsin(kz), 
so 
y (x) = —kC sin(kx) + kD cos(kz). 
The boundary conditions y(0) = y/(L) = 0 give 
C=0 and kDcos(kL) =0. 


Rejecting the trivial solution in which C = D = 0, we 
see that k must satisfy the condition kL = (2n + 1)r/2, 
where n = 0,1,2,.... It follows that the eigenvalues 
are 


à = =k? = —(2n+1)?x7/4L? forn = 0,1,2,3,..., 
with corresponding eigenfunctions 


(2n+ yee . 


(Eqn 8.29) 


tin &) = Dg sin JL 


Ex 8.11 Using Equation 8.39, we obtain 


af ə 2. 

pal Be [k cos(ka — wt)] = —k* sin(kz — wt), 
of o i 
sa A [kcos(kx — wt)] = wksin(ka — wt). 
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Solutions to exercises 


Similarly, Equation 8.40 gives 


2 
vf = A —weos(kx — wt)| = —w? sin(kx — wt), 
2 
“ t = | — wcos(ka — wt)] = kwsin(kz — wt). 
z y 


The two mixed second-order partial derivatives are 
equal, as expected. 


Ex8.12 The function f(x, t) is a function g(z) of 
another function z = x — ct. Using the normal rules of 
calculus for differentiating such a composite function, 
we obtain 


Of dg oz č , 


Bn de p IX] 
Of dg _ Oz, 7 2 
if de aa 


where g'(x — ct) is the derivative of the function g(z), 
evaluated at the point z = x — ct. Differentiating again, 


af od (of 
Ox? Ox (3) 


ef oa (af 
ar a(S) 
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where g”(x — ct) is the second derivative of the 

function g(z), evaluated at the point z = x — ct. Hence 
Of 1Pf_ n ( 
Ox? o 

So any function of the form g(x — ct) satisfies 

Equation 8.43 (assuming that the function can be 

differentiated twice). 


1 
=) = cg" (x — ct) = 0. 
c 


Ex 8.13 Let the probability of outcome 1 be py. 
Then the probability of outcome 2 is pọ = 3p,, and 
the probability of outcome 3 is p = 6p;. So the 
normalization rule gives 

1 = pı + p2 + p3 = pi + 8p1 + 6p1 = 10p1, 
which gives pı = 1/10, po = 3/10 and p3 = 6/10. 
Ex 8.14 (a) The text showed that the expectation 


value of the score on a die is 3.5. Using Equation 8.54, 
the square of the uncertainty of the score S is 


(AS? = ; [a — 3.5)? + (2 — 3.5)? + (3 — 3.5)? 


+ (4 — 3.5)? + (5 — 3.5)? + (6 — 3.5)?] 
= 35/12, 


so the uncertainty of the score is AS = \/35/12 = 1.7. 
(b) Using Equation 8.55 we get 
1 35 
(As)? = = (12427497 44+57+6") -38 = Z, 


which gives the same uncertainty as before: 


AS = \/35/12 = 1.7. 
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Complex numbers 


z= gx +iy = re 


z+2* 
R = 
(= 
e? = cos0 + isin 0 
etit 1 


Elementary functions 


e” = cosh z + sinh x 


cos( + 7) = — cos 0 
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cos(20) = cos? 0 — sin? 6 


sin(A + B) = sin Acos B 4 


sin Asin B = $(cos(A — B) — cos(A + B)) 


sin Acos B = $(sin(A— B) + sin(A + B)) 


Physical constants 


Planck’s constant 
vacuum speed of light 
permittivity of free space 
Boltzmann’s constant 
electron charge 

electron mass 


Bohr radius 


t cos Asin B 


z*=a2-iy=re 


zZ— 2 
21 


Im(z) = 


(a >0,b >0) 


lna + lnb = In(ab) 


hz = 
cosn & J 


sin(@ + 7) = — sin 0 
sin(@ + 7/2) = cos 0 
sin(@ — 7/2) = — cos 0 


sin(20) = 2sin 8 cos @ 


6.63 x 10-34 Js 
3.00 x 108 m s71 
8.85 x 10-2 Fm"! 
1.38 x 1073 JK"! 


lz? = ze* = 2" 4:7? =r? 
z” = rren? 
i ið ef 
sin 0 = - 
2i 
eit /2 = —i 
eng — In(e*) =a 
x =p 
; —e 
sinh z = ———— 
2 


cos? A + sin? A = 1 


Planck’s constant /27 h 
Coulomb law constant 
permeability of free space 


Avogadro’s constant 


tan(0 + 7) = tan 0 
tan(6 + 7/2) = — cot 0 
tan(@ — 1/2) = — cot 0 


tan(20) = 2 tan 0/(1 — tan? 6) 


cos(A + B) = cos A cos B F sin Asin B 


cos Acos B = $(cos(A — B) + cos(A + B) ) 


1.06 x 10784 Js 


z4 8.99 x 109m F™ 
TEQ 
uo 4r x 1077H mt 
Nm 6.02 x 10?3 mol~4 


—1.60 x 1071? C proton charge e 1.60 x 10719 C 
9.11 x 107°! kg proton mass Mp 1.67 x 107” kg 
5.29 x 1071 m atomic mass unit u 1.66 x 107? kg 


Basic equations 


operators =r 

sdi ; l „0V S 
Schrödinger’s equation ih = HY 
stationary state W(x, t) = Yn (e)e Eth 


time-independent S.E. Hun(x) = Enn (z) 


probability p= | wr (2) U(2, t)z 
expectation value (A) = > piAi 


uncertainty AA = ((A’) — (A) ye 
Ehrenfest’s theorem d(x) = (Pz) 
dt m 


Boundary conditions 


w(x) finite at too, cts everywhere 


One-dimensional infinite square well (n = 1,2,.. 


272A? 2. /NTT 


Harmonic oscillator (n = 0,1,2,...) 


En = (n+ $)hw wo = yC/m = y ħ/mwo 
A'yn(2) = Vn FI n4a(2) Adn(2) = Vn Yni (2) 
ac Ob eet a ih „~ N 
ee) Pz Ta = ) 


Free particle, scattering and tunnelling 


probability density = | W(x, t)|? 


wef. 


Ax Ap, > ħ/2 


p(B) 


dy/dzx cts where V(x) finite 


W* (x, t)AV(x, t) dx 


; ) (symmetric well) 


Un(x) = [2 os (=) (n odd) 


y t) i(ka—E,t/h) dk =|. 0) eke q = hk 
P= =f Al ~ on a E á 
J lagoPak = (rx) = | hk|A(h) Pak 
ih əv _ ow Aka ko 
s(a t) = Ņ* Uw T= —— *_ t 
jo(@,t) 2m ( Ox Ox ) (ky + ke)? (step) 


Definite integrals for positive integers n and m 


L f(x)dr=0 (f(x) an odd function) [ f(zjdz=2 [ f(x)dx (f(x) an even function) 
—a 0 
[| moa) sinma) da = F Bam f otne) costma) de = F dum 


T2 n/2 
I sin(nx) sin(max) dz = . Ônm (n+ m even) | cos(nx) cos(mx) dz = E nm (n+m even) 
—r/2 —rT/2 


nt nt we nT 
| cos? x dz = — J sin? zdz = — 
0 2 0 2 
nT P R nT e 
| Par f vim gdr 
0 4 0 4 
nT 3,3 nT ke es | 
| Pore = —— p 1 z? sin? zdr = —— — 
0 6 4 0 6 4 
nn /2 nt nt /2 ge 
| cos? z dz = — 1 sin? z dz = — 
—nr/2 2 —nT/2 2 
nm /2 nen nr nn /2 nen nr 
i OS BER gg Jap” Gaa ag 


nn /2 neq nt /2 
| z’ cos z dz = a (ee — 1), (n odd) f z? cosa da = (—1)"/? 2rn, (n even) 
—nr/2 —nr/2 


0O 2 OO 
f e7” dz = yr 1 ve “dr =n! 
0 


(oe) ! 29 : 
f ieat (n 20) fot ae = yr (k real) 
A i 


co 


00 1x3x---x(2n-1 
f er ig “(On la (n>1) nl=1x2x--xn Ol=1 


—oco 


